Highlights
- Pro
Stars
[ICCV 2025] MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion
MP-SfM: Monocular Surface Priors for Robust Structure-from-Motion (CVPR 2025)
ComfyUI node that adds support for FLF2V with the Wan2.2 VAE (Wan2.2 5B)
Community trainer for Lightricks' LTX Video model 🎬 ⚡️
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
The official implementation of ICCV'25 paper "FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution"
DOC-Depth: A novel approach for dense depth ground truth generation
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Ray tracing and hybrid rasterization of Gaussian particles
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
CUDA accelerated rasterization of gaussian splatting
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
This is a multiview car images dataset created without permission.
[AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matching
Unofficial implementation of "UniSim: A Neural Closed-Loop Sensor Simulator".
An extremely fast Python package and project manager, written in Rust.
NeuroNCAP benchmark for end-to-end autonomous driving
A beautiful, simple, clean, and responsive Jekyll theme for academics
This is the official implementation of the ECCV 2022 paper Object Detection as Probabilistic Set Prediction
[CVPR2024] NeuRAD: Neural Rendering for Autonomous Driving
Software Development Kit for the Zenseact Open Dataset (ZOD)
An emoji guide for your commit messages. 😜
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)