Lists (3)
Sort Name ascending (A-Z)
Stars
ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation
ICRA 2018 "Sparse-to-Dense: Depth Prediction from Sparse Depth Samples and a Single Image" (Torch Implementation)
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
pytorch implementation for Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network arXiv:1609.04802
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.
Open source code for Paper: Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions
Autonomous GPU Kernel Generation via Deep Agents
Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".
🐻 Uniform Discrete Diffusion with Metric Path for Video Generation
A framework to convert any 2D videos to immersive stereoscopic 3D
Web-based 3D visualization + Python
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
[CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
STGAN: A Unified Selective Transfer Network for Arbitrary Image Attribute Editing
From Big to Small: Multi-Scale Local Planar Guidance for Monocular Depth Estimation
Monocular Depth Estimation Toolbox based on MMSegmentation.
Depth Any Video with Scalable Synthetic Data (ICLR 2025)
Teravus / Chunk_E2FGVI
Forked from MCG-NKU/E2FGVIArbitrary Length Video Using Frame Chunking Based on official repository of code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)
[CVPR 2021] Self-supervised depth estimation from short sequences
Self-supervised temporally consistent depth estimation
This repo contains the projects: 'Virtual Normal', 'DiverseDepth', and '3D Scene Shape'. They aim to solve the monocular depth estimation, 3D scene reconstruction from single image problems.
Enforcing Temporal Consistency in Video Depth Estimation, ICCV-W 2021.