Skip to content
View ziqihuangg's full-sized avatar

Block or report ziqihuangg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

the Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Python 62 3 Updated Jan 4, 2026

MatAnyone 2: Scaling Video Matting via a Learned Quality Evaluator

100 1 Updated Dec 15, 2025

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 170 14 Updated Dec 19, 2025

This is a collection of recent papers on reasoning in video generation models.

93 2 Updated Jan 8, 2026

🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.

128 5 Updated Jan 12, 2026
Python 30 1 Updated Dec 17, 2025
Python 21 1 Updated Dec 10, 2025

Code for CineScale, higher-resolution video generation based on Wan

Python 182 2 Updated Aug 25, 2025

[ICIP2025 Spotlight] Efficient and High-Fidelity Image Generation

JavaScript 2 1 Updated Jan 12, 2026

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,605 1,612 Updated Dec 17, 2025

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Python 89 3 Updated Sep 12, 2025

A list of works on video generation towards world model

327 6 Updated Jan 14, 2026

Lets make video diffusion practical!

Python 16,512 1,623 Updated Oct 16, 2025

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)

Python 52 2 Updated Jan 14, 2026

[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation

Python 1,458 91 Updated Nov 2, 2025

A Python package that makes it easy for developers to create AI apps powered by various AI providers.

Python 1,647 209 Updated Apr 8, 2025

[ACL2025 Oral & Award] Evaluate Image/Video Generation like Humans - Fast, Explainable, Flexible

Python 114 5 Updated Aug 10, 2025

Understand Human Behavior to Align True Needs

Python 4,052 397 Updated Aug 13, 2025

Implementation of P+: Extended Textual Conditioning in Text-to-Image Generation

Python 49 1 Updated Mar 26, 2023

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 11,115 1,090 Updated Nov 18, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 2,250 94 Updated Feb 16, 2025

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

402 19 Updated Sep 22, 2025

[CSUR] A Survey on Video Diffusion Models

2,255 111 Updated Jun 27, 2025

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 6,450 724 Updated Mar 19, 2025

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Python 539 21 Updated Jan 18, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 935 43 Updated Sep 27, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 1,422 97 Updated Jan 9, 2026

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Jupyter Notebook 3,003 202 Updated Mar 9, 2024

FreeU: Free Lunch in Diffusion U-Net (CVPR2024 Oral)

1,894 77 Updated Dec 24, 2024

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 21,534 2,926 Updated Jan 13, 2026
Next