Stars
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Official Implementation of TETA metric from ECCV22 paper: Tracking Every Thing In The Wild
Official Implementation of CVPR24 highlight paper: Matching Anything by Segmenting Anything
Universal Monocular Metric Depth Estimation
[ICCV23] Official Implementation of DARTH: Holistic Test-time Adaptation for Multiple Object Tracking
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor.
[ICCV 2023] CTVIS: Consistent Training for Online Video Instance Segmentation
iDisc: Internal Discretization for Monocular Depth Estimation [CVPR 2023]
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
A Platform for Visual Learning from Human Feedback
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]
The official implementation for the CVPR 2023 paper Joint Visual Grounding and Tracking with Natural Language Specification.
Scalabel: A versatile web-based visual data annotation tool
Toolkit of BDD100K Dataset for Heterogeneous Multitask Learning - CVPR 2020 Oral Paper
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Unlock your displays on your Mac! Flexible HiDPI scaling, XDR/HDR extra brightness, virtual screens, DDC control, extra dimming, PIP/streaming, EDID override and lots more!
Official implementation of Dense Prediction with Attentive Feature Aggregation, WACV 2023
🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.
World of Warcraft addon that provides a powerful framework to display customizable graphics on your screen.
Implementation of Tracking Every Thing in the Wild, ECCV 2022