Skip to content
View haoyi-duan's full-sized avatar
🤖
🤖

Highlights

  • Pro

Block or report haoyi-duan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 6 Updated Jul 28, 2025

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 845 37 Updated Oct 28, 2025

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

543 35 Updated Nov 11, 2025

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 155 4 Updated Oct 21, 2025

This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Python 90 1 Updated Apr 9, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,733 81 Updated Sep 8, 2025

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 459 18 Updated Mar 19, 2025

Enjoy the magic of Diffusion models!

Python 10,650 994 Updated Nov 10, 2025

[NeurIPS 2025] Official code for Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos

Python 72 3 Updated Oct 24, 2025

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,847 190 Updated Nov 3, 2025

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,362 470 Updated Aug 7, 2024

[SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models"

Python 113 11 Updated Oct 27, 2025

[arXiv 2025] VisualMimic: Visual Humanoid Loco-Manipulation via Motion Tracking and Generation

Python 228 3 Updated Oct 3, 2025

MetaDrive: Lightweight driving simulator for everyone

Python 1,038 163 Updated Aug 15, 2025

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Makefile 101 24 Updated Feb 1, 2024

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python 767 33 Updated Jul 2, 2025

[arXiv 2025] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWIST.

Python 1,132 154 Updated Nov 8, 2025

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Python 186 28 Updated Sep 21, 2022

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 364 14 Updated May 30, 2025

A pipeline parallel training script for diffusion models.

Python 1,702 231 Updated Nov 7, 2025

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python 5,883 721 Updated Aug 16, 2024

Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.

Python 114 4 Updated Jun 13, 2024

Video-P2P: Video Editing with Cross-attention Control

Python 422 26 Updated Jun 30, 2025

MagicEdit: High-Fidelity Temporally Coherent Video Editing

1,805 102 Updated Aug 29, 2023

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 3,000 202 Updated Mar 9, 2024

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,866 381 Updated Apr 7, 2024

Make self forcing endless. Add cache purging. Add prompt controllability.

Python 65 2 Updated Sep 9, 2025

Tooling for the Common Objects In 3D dataset.

Python 1,087 85 Updated Aug 14, 2024
Next