Skip to content
View xiaohu2015's full-sized avatar
🏠
Working from home
🏠
Working from home
  • HUST
  • China

Block or report xiaohu2015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".

Python 611 45 Updated Oct 22, 2025

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

Python 159 8 Updated Jul 10, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,728 81 Updated Sep 8, 2025

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 398 5 Updated Aug 8, 2025

Rectified Flow Inversion (RF-Inversion) - ICLR 2025

Python 460 18 Updated Mar 19, 2025

Scalable and memory-optimized training of diffusion models

Python 1,297 139 Updated Jun 4, 2025

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,116 308 Updated Dec 21, 2024

CogView4, CogView3-Plus and CogView3(ECCV 2024)

Python 1,091 79 Updated Mar 29, 2025

Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.

1,716 179 Updated Nov 25, 2024
Python 2,220 159 Updated Nov 8, 2024

A general fine-tuning kit geared toward diffusion models.

Python 2,598 253 Updated Nov 11, 2025

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 12,068 1,071 Updated Oct 29, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,116 1,211 Updated Nov 4, 2025

A powerful anti-burn allowing much higher CFG scales for latent diffusion models (for ComfyUI)

Python 215 12 Updated Oct 25, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 37,205 6,233 Updated Jul 26, 2024

Bring portraits to life!

Python 17,276 1,783 Updated Jun 14, 2025

"Probabilistic Machine Learning" - a book series by Kevin Murphy

Jupyter Notebook 5,405 627 Updated Apr 18, 2025

Understand Human Behavior to Align True Needs

Python 4,022 389 Updated Aug 13, 2025

Kolors Team

Python 4,571 350 Updated Nov 13, 2024

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

Python 1,620 80 Updated Sep 25, 2024

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,302 116 Updated Sep 30, 2025

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Jupyter Notebook 447 31 Updated Apr 13, 2025
Python 137 10 Updated Jun 16, 2024

⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)

Python 634 41 Updated Mar 11, 2025

Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars

Jupyter Notebook 387 24 Updated Apr 8, 2025

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,885 89 Updated Aug 15, 2024

EDM2 and Autoguidance -- Official PyTorch implementation

Python 786 50 Updated Dec 9, 2024

Unofficial Implementation of Animate Anyone by Novita AI

Python 783 68 Updated May 31, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,202 1,664 Updated Sep 24, 2025
Next