-
Harbin Institute of Technology
- China
-
15:46
(UTC +08:00) - https://mhzhou.com/
- https://orcid.org/0000-0003-3250-4978
Stars
MineContext is your proactive context-aware AI partner(Context-Engineering+ChatGPT Pulse)
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
Wan: Open and Advanced Large-Scale Video Generative Models
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation
Enjoy the magic of Diffusion models!
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
This repository open-sources CreatiPoster, an AI-driven graphic design generation system for multi-layer and editable compositions with strong visual appeal.
Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
Docker build for FFmpeg on Ubuntu / Alpine / Centos / Scratch / nvidia / vaapi
[CVPR 2025] A Large-Scale High-Quality Dataset for Enhancing Human-Centric Video Generation
An official implementation of DanceGRPO: Unleashing GRPO on Visual Generation
Awesome Unified Multimodal Models
An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applications.
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
世界上最好的MCP Servers的列表,The best mcp servers in the world.
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉
Training Large Language Model to Reason in a Continuous Latent Space
No fortress, purely open ground. OpenManus is Coming.
Paper backup generator suitable for long-term storage.
Spotify for the terminal written in Rust 🚀