haoyi-duan

🤖

Haoyi Duan haoyi-duan

🤖

MS @ Stanford, B.Eng @ ZJU

38 followers · 8 following

Stanford University
Stanford, CA
21:00 (UTC -08:00)
haoyi-duan.github.io

Achievements

Highlights

Stars

morphicfilms / frames-to-video

Python 154 11 Updated Nov 8, 2025

yuhui-zh15 / MixedModalitySearch

Python 6 Updated Jul 28, 2025

timothybrooks / instruct-pix2pix

Python 6,833 576 Updated Mar 3, 2024

fallenshock / FlowEdit

Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"

Python 845 37 Updated Oct 28, 2025

mayuelala / Awesome-Controllable-Video-Generation

[ArXiv 2025] A survey about controllable video generation: This repo is the official awesome of "Controllable video generation: A survey"

543 35 Updated Nov 11, 2025

Jiawei-Yang / DeTok

Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"

Jupyter Notebook 155 4 Updated Oct 21, 2025

zibojia / SENORITA

This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Python 90 1 Updated Apr 9, 2025

stepfun-ai / Step1X-Edit

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,733 81 Updated Sep 8, 2025

LituRout / RF-Inversion

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 459 18 Updated Mar 19, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,650 994 Updated Nov 10, 2025

Kaihua-Chen / cog-nvs

[NeurIPS 2025] Official code for Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos

Python 72 3 Updated Oct 24, 2025

Paper2Poster / Paper2Poster

[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers

Python 2,847 190 Updated Nov 3, 2025

QwenLM / Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 6,362 470 Updated Aug 7, 2024

KIMGEONUNG / VideoFrom3D

[SIGGRAPH-ASIA 2025] Official implementation of "VideoFrom3D: 3D Scene Video Generation via Complementary Image and Video Diffusion Models"

Python 113 11 Updated Oct 27, 2025

visualmimic / VisualMimic

[arXiv 2025] VisualMimic: Visual Humanoid Loco-Manipulation via Motion Tracking and Generation

Python 228 3 Updated Oct 3, 2025

metadriverse / metadrive

MetaDrive: Lightweight driving simulator for everyone

Python 1,038 163 Updated Aug 15, 2025

yujun0-0 / MMA-Net

VIL-100: A New Dataset and A Baseline Model for Video Instance Lane Detection (ICCV 2021)

Makefile 101 24 Updated Feb 1, 2024

OpenDriveLab / DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving

Python 767 33 Updated Jul 2, 2025

YanjieZe / GMR

[arXiv 2025] GMR: General Motion Retargeting. Retarget human motions into diverse humanoid robots in real time on CPU. Retargeter for TWIST.

Python 1,132 154 Updated Nov 8, 2025

m-bain / CondensedMovies

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Python 186 28 Updated Sep 21, 2022

NJU-PCALab / OpenVid-1M

[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Python 364 14 Updated May 30, 2025

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 1,702 231 Updated Nov 7, 2025

VAST-AI-Research / TripoSR

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python 5,883 721 Updated Aug 16, 2024

aejion / 4Diffusion

Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.

Python 114 4 Updated Jun 13, 2024

dvlab-research / Video-P2P

Video-P2P: Video Editing with Cross-attention Control

Python 422 26 Updated Jun 30, 2025

magic-research / magic-edit

MagicEdit: High-Fidelity Temporally Coherent Video Editing

1,805 102 Updated Aug 29, 2023

williamyang1991 / Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 3,000 202 Updated Mar 9, 2024

ant-research / CoDeF

[CVPR'24 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Python 4,866 381 Updated Apr 7, 2024

Dere-Wah / Self-Forcing-Endless

Forked from guandeh17/Self-Forcing

Make self forcing endless. Add cache purging. Add prompt controllability.

Python 65 2 Updated Sep 9, 2025

facebookresearch / co3d

Tooling for the Common Objects In 3D dataset.

Python 1,087 85 Updated Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haoyi Duan haoyi-duan

Achievements

Achievements

Highlights

Block or report haoyi-duan

Stars

morphicfilms / frames-to-video

yuhui-zh15 / MixedModalitySearch

timothybrooks / instruct-pix2pix

fallenshock / FlowEdit

mayuelala / Awesome-Controllable-Video-Generation

Jiawei-Yang / DeTok

zibojia / SENORITA

stepfun-ai / Step1X-Edit

LituRout / RF-Inversion

modelscope / DiffSynth-Studio

Kaihua-Chen / cog-nvs

Paper2Poster / Paper2Poster

QwenLM / Qwen-VL

KIMGEONUNG / VideoFrom3D

visualmimic / VisualMimic

metadriverse / metadrive

yujun0-0 / MMA-Net

OpenDriveLab / DriveAGI

YanjieZe / GMR

m-bain / CondensedMovies

NJU-PCALab / OpenVid-1M

tdrussell / diffusion-pipe

VAST-AI-Research / TripoSR

aejion / 4Diffusion

dvlab-research / Video-P2P

magic-research / magic-edit

williamyang1991 / Rerender_A_Video

ant-research / CoDeF

Dere-Wah / Self-Forcing-Endless

facebookresearch / co3d