Stars
ScaleCUA is the open-sourced computer use agents that can operate on corss-platform environments (Windows, macOS, Ubuntu, Android).
A Paper List for Humanoid Robot Learning.
[arXiv 2025] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWIST.
[CoRL 2025] TWIST: Teleoperated Whole-Body Imitation System
[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
[ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE
[ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation
A unified framework for 3D content generation.
Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
[CVPR 2025 Highlight] DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos
[AAAI 2025] Follow-Your-Canvas: This repo is the official implementation of "Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation"
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
A comprehensive collection of IQA papers
[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
[Siggraph Asia 2024 & IJCV 2025] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
[ECCV 2024] Noise Calibration: Plug-and-play Content-Preserving Video Enhancement using Pre-trained Video Diffusion Models
Collection of recent shadow removal works, including papers, codes, datasets, and metrics.
Code for FreeTraj, a tuning-free method for trajectory-controllable video generation
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
An innovative method designed to augment the capabilities of existing video diffusion models