Highlights
- Pro
Stars
[CVPR 2025 Oral & Award Candidate] Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models
[ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes
[CVPR2025] Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction
WorldSplat: Gaussian-Centric Feed-Forward 4D Scene Generation for Autonomous Driving
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
[CVPR'25] DepthSplat: Connecting Gaussian Splatting and Depth
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
[NeurIPS 2025] ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
Dynamic 3D Foundation Model using Causal Transformer
[ICCV 2025] Driving Scene Synthesis on Free-form Trajectories with Generative Prior
[CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
[ICCV2025] ExploreGS: Explorable 3D Scene Reconstruction with Virtual Camera Samplings and Diffusion Priors
[ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Scalable and Generalizable Autonomous Driving Scene Synthesis
[ICCV 2025] DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)
Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"
The official repo for "LongDWM: Cross-Granularity Distillation for Building a Long-Term Driving World Model"
official code of "MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction"
A collection of papers about domain adaptation for 3D object detection. Welcome to PR the works (papers, repositories) that are missed by the repo.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
[ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"