Skip to content
View YinHan-Zhang's full-sized avatar

Block or report YinHan-Zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"

Python 267 39 Updated Oct 17, 2025

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生…

39 4 Updated Apr 26, 2025

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 288 13 Updated Nov 5, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,667 327 Updated Jan 21, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,590 197 Updated Nov 16, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,056 52 Updated Mar 5, 2025

The ultimate training toolkit for finetuning diffusion models

Python 6,869 832 Updated Nov 10, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,324 77 Updated Sep 12, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,539 110 Updated Nov 13, 2025

Controlnet module for Wan2.1

Python 25 1 Updated Aug 4, 2025

Scaling Diffusion Transformers with Mixture of Experts

Python 401 19 Updated Sep 9, 2024

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,496 82 Updated Nov 10, 2025

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

Python 614 29 Updated May 15, 2025

[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"

Python 449 24 Updated Aug 4, 2025

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,883 181 Updated Aug 7, 2024

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,608 78 Updated Oct 23, 2025

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 325 9 Updated Jul 4, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 22,184 2,674 Updated Jun 12, 2025

The official implementation of "MagicColor: Multi-Instance Sketch Colorization"

Python 119 8 Updated Jun 30, 2025

Code release for https://kovenyu.com/WonderWorld/

Python 670 33 Updated Apr 14, 2025

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Python 135 8 Updated Jun 18, 2024

Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING

Python 31 6 Updated Jun 1, 2022

Pytorch implementation of image captioning using transformer-based model.

Jupyter Notebook 68 9 Updated Apr 13, 2023

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,795 130 Updated Jul 1, 2025

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 1,164 95 Updated Sep 13, 2024

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 23,290 3,457 Updated Nov 16, 2025

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,508 1,061 Updated Jun 26, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 52,305 5,726 Updated Sep 10, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,345 2,326 Updated Apr 29, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,362 511 Updated Aug 11, 2025
Next