YinHan-Zhang

YinHan-Zhang

7 followers · 2 following

Achievements

Stars

HKUDS / VideoAgent

"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"

Python 267 39 Updated Oct 17, 2025

km1994 / AwesomeMultiModel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享大语言模型（LLMs），大模型高效微调（SFT）,检索增强生成（RAG），智能体（Agent），PPT自动生成, 角色扮演，文生图（Stable Diffusion），图像文字识别（OCR），语音识别（ASR），语音合成（TTS），人像分割（SA），多模态（VLM），Ai 换脸(Face Swapping), 文生视频(VD)，图生…

39 4 Updated Apr 26, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 288 13 Updated Nov 5, 2025

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,667 327 Updated Jan 21, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,590 197 Updated Nov 16, 2025

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,056 52 Updated Mar 5, 2025

ostris / ai-toolkit

The ultimate training toolkit for finetuning diffusion models

Python 6,869 832 Updated Nov 10, 2025

bytedance / UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,324 77 Updated Sep 12, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,539 110 Updated Nov 13, 2025

TheDenk / wan2.1-dilated-controlnet

Controlnet module for Wan2.1

Python 25 1 Updated Aug 4, 2025

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 401 19 Updated Sep 9, 2024

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,496 82 Updated Nov 10, 2025

limuloo / MIGC

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

Python 614 29 Updated May 15, 2025

Haian-Jin / LVSM

[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"

Python 449 24 Updated Aug 4, 2025

muskie82 / MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,883 181 Updated Aug 7, 2024

KlingTeam / ReCamMaster

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,608 78 Updated Oct 23, 2025

hanyang-21 / VideoScene

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 325 9 Updated Jul 4, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 22,184 2,674 Updated Jun 12, 2025

YinHan-Zhang / MagicColor

The official implementation of "MagicColor: Multi-Instance Sketch Colorization"

Python 119 8 Updated Jun 30, 2025

KovenYu / WonderWorld

Code release for https://kovenyu.com/WonderWorld/

Python 670 33 Updated Apr 14, 2025

nv-tlabs / stmc

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Python 135 8 Updated Jun 18, 2024

milkymap / transformer-image-captioning

Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING

Python 31 6 Updated Jun 1, 2022

zarzouram / image_captioning_with_transformers

Pytorch implementation of image captioning using transformer-based model.

Jupyter Notebook 68 9 Updated Apr 13, 2023

OpenMotionLab / MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,795 130 Updated Jul 1, 2025

EricGuo5513 / momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 1,164 95 Updated Sep 13, 2024

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 23,290 3,457 Updated Nov 16, 2025

YaoFANGUK / video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,508 1,061 Updated Jun 26, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,305 5,726 Updated Sep 10, 2025

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,345 2,326 Updated Apr 29, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,362 511 Updated Aug 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YinHan-Zhang

Achievements

Achievements

Block or report YinHan-Zhang

Stars

HKUDS / VideoAgent

km1994 / AwesomeMultiModel

NVlabs / rcm

facebookresearch / co-tracker

hao-ai-lab / FastVideo

tianweiy / DMD2

ostris / ai-toolkit

bytedance / UNO

aigc-apps / VideoX-Fun

TheDenk / wan2.1-dilated-controlnet

feizc / DiT-MoE

FoundationVision / Infinity

limuloo / MIGC

Haian-Jin / LVSM

muskie82 / MonoGS

KlingTeam / ReCamMaster

hanyang-21 / VideoScene

datawhalechina / llm-cookbook

YinHan-Zhang / MagicColor

KovenYu / WonderWorld

nv-tlabs / stmc

milkymap / transformer-image-captioning

zarzouram / image_captioning_with_transformers

OpenMotionLab / MotionGPT

EricGuo5513 / momask-codes

HKUDS / LightRAG

YaoFANGUK / video-subtitle-remover

RVC-Boss / GPT-SoVITS

Sanster / IOPaint

antgroup / echomimic_v2