Skip to content
View ybbbbt's full-sized avatar

Highlights

  • Pro

Organizations

@zju3dv

Block or report ybbbbt

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Industry-level video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.

732 78 Updated Aug 27, 2025

[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Python 2,261 146 Updated Sep 19, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,611 78 Updated Oct 23, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,031 114 Updated Nov 12, 2025

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Python 1,191 106 Updated Oct 15, 2025

Lets make video diffusion practical!

Python 16,192 1,561 Updated Oct 16, 2025

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,443 152 Updated Apr 18, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,543 110 Updated Nov 17, 2025

[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling

Python 532 6 Updated Oct 26, 2025

ComfyUI Node

Python 672 39 Updated Jun 18, 2025

AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers

Python 138 11 Updated Sep 16, 2025

[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Python 788 39 Updated Aug 8, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,712 2,138 Updated Jul 17, 2025

Collection of scripts to build small-scale datasets for fine-tuning video generation models.

Python 70 7 Updated Mar 17, 2025

Lora traing script for Lightricks LTX-video

Python 67 4 Updated Feb 12, 2025

[NOTE] I do not have enough ressources to maintain VMS, please use Ostris's AI-Tookit instead

Python 41 2 Updated Oct 3, 2025

📄 Configuration files that enhance Cursor AI editor experience with custom rules and behaviors

MDX 35,481 3,016 Updated Oct 24, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 2,694 266 Updated Nov 6, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,507 369 Updated Nov 18, 2025

[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".

Python 436 26 Updated Aug 27, 2025

The ultimate training toolkit for finetuning diffusion models

Python 6,878 833 Updated Nov 17, 2025

Scalable and memory-optimized training of diffusion models

Python 1,301 141 Updated Jun 4, 2025

A pipeline parallel training script for diffusion models.

Python 1,718 231 Updated Nov 7, 2025

musubi-tuner modified to tune image2video/video infilling

Python 33 3 Updated Jan 30, 2025

Official repository for LTX-Video

Python 8,784 812 Updated Oct 25, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 17,281 1,897 Updated Oct 21, 2025

A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and professional-looking video outputs by incorporating iconic …

Python 45 3 Updated Dec 31, 2024

A general fine-tuning kit geared toward diffusion models.

Python 2,600 255 Updated Nov 18, 2025

Code for our paper: Learning Camera Movement Control from Real-World Drone Videos

Python 32 4 Updated Apr 16, 2025
Next