Stars
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
StreamDiffusion, Live Stream APP
[NeurIPS 2025]: CPO: Condition Preference Optimization for Controllable Image Generation
Officail Implementation for "Text to Sketch Generation with Multi-Styles" [NeurIPS 2025]
Offical code for "FastGS: Training 3D Gaussian Splatting in 100 Seconds"
[NeurIPS 2025 Oral]Infinity⭐️: Unified Spacetime AutoRegressive Modeling for Visual Generation
Lumos-Custom Project: research for customized video generation in the Lumos Project.
NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation
The official repository for the paper "ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning"
Transform your 3D texturing workflow with the power of generative AI, directly within Blender!
Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".
[NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
[NeurIPS'25] Official repository of Concerto: Joint 2D-3D Self-Supervised Learning Emerges Spatial Representations
Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives
Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"
[CVPR 2025] The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation
LightMem: Lightweight and Efficient Memory-Augmented Generation
Code implementation of the paper "World-in-World: World Models in a Closed-Loop World"
[AAAI 2026] Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback
Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation
Official implementation for the NeurIPS 2025 paper: "FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models". A lightweight, training-free framework for modul…
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Official implementation of "DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training".