Skip to content
View MingtaoGuo's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Sichuan University
  • Chengdu

Block or report MingtaoGuo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,450 91 Updated Sep 11, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,520 1,285 Updated Oct 12, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,963 327 Updated Nov 7, 2025

An inference and training framework for multiple image input in Flux Kontext dev

Jupyter Notebook 417 29 Updated Sep 1, 2025

[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 1,936 121 Updated Nov 2, 2025

[CAD/Graphics 2025][Computers & Graphics] Navigating Large-Pose Challenge for High-Fidelity Face Reenactment with Video Diffusion Model

Python 5 2 Updated Sep 2, 2025

Code of π^3: Permutation-Equivariant Visual Geometry Learning

Python 1,345 66 Updated Sep 10, 2025

Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)

Python 1,687 128 Updated Jul 25, 2025

[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation

Python 2,651 449 Updated Sep 25, 2025

FACM: Flow-Anchored Consistency Models

Python 127 2 Updated Aug 6, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,570 1,194 Updated Oct 11, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,317 2,321 Updated Apr 29, 2025

从零手搓Flow Matching(Rectified Flow)

Python 528 29 Updated Dec 7, 2024

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,405 3,894 Updated Apr 19, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,016 112 Updated Oct 29, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,401 241 Updated Oct 17, 2025

[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models

Python 303 9 Updated Apr 24, 2025

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,598 76 Updated Oct 23, 2025

Lets make video diffusion practical!

Python 16,134 1,550 Updated Oct 16, 2025

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Python 809 55 Updated Apr 27, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,301 107 Updated Mar 11, 2025

Pytorch Implementation of "Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model" (SIGGRAPH 2025)

Python 209 23 Updated Jul 14, 2024

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

1,362 75 Updated Nov 6, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 14,657 2,115 Updated Jul 17, 2025

Enjoy the magic of Diffusion models!

Python 10,609 990 Updated Nov 7, 2025

[CVPR 2025] High-Fidelity Relightable Monocular Portrait Animation with Lighting-Controllable Video Diffusion Model

Python 56 5 Updated Jun 4, 2025

Pippo: High-Resolution Multi-View Humans from a Single Image

Python 614 47 Updated Apr 4, 2025

Facial Expression Analysis Toolbox

Python 335 90 Updated Jan 12, 2025

Official pytorch implementation of paper "High-quality Animatable Eyelid Shapes from Lightweight Captures" (SIGGRAPH Asia 2024).

Python 38 3 Updated Dec 11, 2024
Next