Starred repositories
An LLM agent that conducts deep research (local and web) on any given topic and generates a long report with citations.
A beautiful, simple, clean, and responsive Jekyll theme for academics
Official implementation of AnimateDiff.
Open-Sora: Democratizing Efficient Video Production for All
A vision-language model for recognizing surgical objects in surgical images and videos.
awesome papers in LLM interpretability
NaturalProofs: Mathematical Theorem Proving in Natural Language (NeurIPS 2021 Datasets & Benchmarks)
An evaluation dataset comprising of 274 grid-based puzzles with different complexities
Official Repository for ICLR 2025 (Oral) BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
A high-throughput and memory-efficient inference and serving engine for LLMs
Convert PDF to markdown + JSON quickly with high accuracy
This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment" and our MICCAI 2023 paper "Encoding Surgical Videos as Spa…
Official repo for the paper "Bilinear MLPs enable weight-based mechanistic interpretability".
A virtual environment for developing and evaluating automated scientific discovery agents.
GitHub Copilot extension for JupyterLab
Generating figures from research papers, using textual captions from the paper.
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
Official Repository for the Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrument, verb, target> labels for every surgical fine-grained act…
[MedIA'25] Learning multi-modal representations by watching hundreds of surgical video lectures