Skip to content
View ziqihuangg's full-sized avatar

Block or report ziqihuangg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

the Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Python 35 Updated Jan 4, 2026

MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator

93 1 Updated Dec 15, 2025

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 169 14 Updated Dec 19, 2025

This is a collection of recent papers on reasoning in video generation models.

91 2 Updated Jan 8, 2026

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

125 5 Updated Dec 30, 2025
Python 29 1 Updated Dec 17, 2025
Python 21 1 Updated Dec 10, 2025

Code for CineScale, higher-resolution video generation based on Wan

Python 182 2 Updated Aug 25, 2025

[ICIP2025 Spotlight] Efficient and High-Fidelity Image Generation

Python 2 1 Updated Aug 20, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,450 1,593 Updated Dec 17, 2025

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 88 3 Updated Sep 12, 2025

A list of works on video generation towards world model

320 6 Updated Jan 5, 2026

Lets make video diffusion practical!

Python 16,490 1,612 Updated Oct 16, 2025

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 52 2 Updated Sep 21, 2025

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,445 91 Updated Nov 2, 2025

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,648 209 Updated Apr 8, 2025

[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible

Python 114 5 Updated Aug 10, 2025

Understand Human Behavior to Align True Needs

Python 4,052 396 Updated Aug 13, 2025

Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation

Python 49 1 Updated Mar 26, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,110 1,091 Updated Nov 18, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,248 94 Updated Feb 16, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

401 19 Updated Sep 22, 2025

[CSUR] A Survey on Video Diffusion Models

2,251 111 Updated Jun 27, 2025

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 6,440 721 Updated Mar 19, 2025

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 538 21 Updated Jan 18, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 935 43 Updated Sep 27, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,411 97 Updated Jan 9, 2026

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 3,002 202 Updated Mar 9, 2024

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

1,893 77 Updated Dec 24, 2024

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 21,449 2,913 Updated Jan 6, 2026
Next