Stars
A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)
[ICCV 2025] MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
Code for Streaming 4D Visual Geometry Transformer
Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".
[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers
Long-RL: Scaling RL to Long Sequences (NeurIPS 2025)
DeepVerse: 4D Autoregressive Video Generation as a World Model
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Build a Jekyll blog in minutes, without touching the command line.
Official project page of MTVCrafter, a new paradigm for animating arbitrary characters with 4D motion tokens.
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
[NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
[ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory
[ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".
Official Implementation of [AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models]
The implementation of Extreme Viewpoint 4D Video Generation
[ICCV 2025] Official code for AnimateAnyMesh: A Feed-Forward 4D Foundation Model for Text-Driven Universal Mesh Animation
cjeen / LoRAEdit
Forked from tdrussell/diffusion-pipeWe achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additional reference conditions.
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
[NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model
[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"
[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
[Official Code] Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction
A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)
Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]