Skip to content
View yzy-thu's full-sized avatar
🏠
Working from home
🏠
Working from home

Organizations

@THUDM

Block or report yzy-thu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives

Python 499 82 Updated Nov 8, 2025

Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple image references.

Python 48 3 Updated Nov 25, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,870 206 Updated Sep 12, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 39,996 6,940 Updated Nov 25, 2025

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

Python 300 19 Updated May 17, 2025
Python 20 1 Updated Jul 20, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,058 58 Updated Aug 7, 2025

Concat-ID: Towards Universal Identity-Preserving Video Synthesis

Python 63 Updated May 7, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,338 175 Updated Mar 13, 2025

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 335 9 Updated Mar 26, 2025

Official code for MotionBench (CVPR 2025)

Python 59 2 Updated Mar 3, 2025

[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Python 786 44 Updated Aug 30, 2025

Simple Controlnet module for CogvideoX model.

Jupyter Notebook 173 11 Updated Jan 12, 2025

Helpful tools and examples for working with flex-attention

Python 1,060 64 Updated Nov 18, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,426 289 Updated Nov 19, 2025

[CVPR 2025 Highlight] 3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 1,013 39 Updated Jul 14, 2025

Next-Token Prediction is All You Need

Python 2,251 89 Updated Nov 19, 2025

Keyframe Interpolation with CogvideoX

Python 139 3 Updated Oct 31, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

430 23 Updated Mar 8, 2025

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,213 57 Updated Jul 9, 2025

Scalable and memory-optimized training of diffusion models

Python 1,306 142 Updated Jun 4, 2025

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,093 78 Updated Mar 29, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,556 110 Updated Nov 25, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,176 1,219 Updated Nov 4, 2025

PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)

Jupyter Notebook 527 31 Updated Sep 8, 2025

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 560 31 Updated Sep 16, 2024

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Python 441 23 Updated Jul 5, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,466 76 Updated Feb 19, 2025
Next