This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 728 19 Updated Sep 10, 2025

Visual-AI / 3DRS

[NeurIPS 2025] MLLMs Need 3D-Aware Representation Supervision for Scene Understanding

Python 115 Updated Nov 6, 2025

SpatialVision / Prior-Depth-Anything

Python 434 35 Updated Sep 2, 2025

NVIDIA / cuda-python

CUDA Python: Performance meets Productivity

Python 3,044 223 Updated Nov 22, 2025

swc-17 / SparseDrive

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Python 768 105 Updated Mar 17, 2025

nv-tlabs / Difix3D

[CVPR 2025 Oral & Best Paper Finalist] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Python 919 68 Updated Jun 28, 2025

rolpotamias / WiLoR

WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild

Python 401 31 Updated Aug 1, 2025

gorkaydemir / track_on

[ICLR 2025] Track-On: Transformer-based Online Point Tracking with Memory, and [arXiv 2025] Track-On2: Enhancing Online Point Tracking with Memory

Python 81 5 Updated Oct 17, 2025

CUT3R / CUT3R

Official implementation of Continuous 3D Perception Model with Persistent State

Python 1,204 66 Updated Aug 27, 2025

alibaba-damo-academy / Uni3C

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 438 23 Updated Sep 21, 2025

zbw001 / TAPIP3D

TAPIP3D: Tracking Any Point in Persistent 3D Geometry

Python 326 22 Updated Sep 27, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,219 1,562 Updated Oct 16, 2025

lpiccinelli-eth / UniDepth

Universal Monocular Metric Depth Estimation

Python 1,073 100 Updated May 18, 2025

nicolasugrinovic / multiphys

Code for the paper MultiPhys: Multi-Person Physics-aware 3D Motion Estimation (CVPR 2024)

Python 77 4 Updated Mar 24, 2025

ttxskk

Highlights

Starred repositories

Awesome Lists