Highlights
- Pro
Stars
VGGT 3D Vision Agent optimized for Apple Silicon with Metal Performance Shaders
Reference PyTorch implementation and models for DINOv3
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
[NeurIPS 2025] The official repository of "Sekai: A Video Dataset towards World Exploration"
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
Repository for running the VGGT model in PyTorch
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Web-based 3D visualization + Python
[CVPR'25 Highlight] You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
[ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew Zisserman
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
[ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
[ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman
[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models
Gaussian Splatting from VGGSfM and Mast3r, and their comparison
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!