Skip to content
View RockeyCoss's full-sized avatar
😧
😧
  • the Solar System

Highlights

  • Pro

Block or report RockeyCoss

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scalable group inference for generating high quality and diverse images with diffusion models.

Python 38 1 Updated Aug 31, 2025

LoRA fine-tuning for FLUX.2 to improve virtual try-on (VTON) capabilities

Python 2 Updated Dec 9, 2025

Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

Python 76 3 Updated Oct 23, 2023

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 321 21 Updated Dec 31, 2025

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 623 57 Updated Jan 5, 2026

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 174 7 Updated Dec 15, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,364 130 Updated Dec 30, 2025
Python 8,812 524 Updated Jan 7, 2026

Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".

Python 193 12 Updated Apr 13, 2025

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,944 121 Updated Dec 8, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,869 310 Updated Jun 12, 2025

Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation

Python 41 1 Updated Aug 5, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,685 58 Updated Dec 26, 2025

SigLIP-based Aesthetic Score Predictor

Python 369 8 Updated Dec 18, 2024

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,907 398 Updated Dec 31, 2025

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 536 18 Updated Jan 6, 2026

Control and limit battery charging on Apple Silicon MacBooks.

Go 1,346 54 Updated Jan 7, 2026

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,227 202 Updated Jan 8, 2026

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,237 40 Updated Oct 26, 2025

An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching

Python 57 3 Updated Sep 11, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,928 236 Updated Jan 10, 2026

The official UniVerse-1 code.

Python 117 8 Updated Oct 13, 2025

Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Python 196 8 Updated Jan 8, 2026
Python 594 18 Updated Dec 25, 2025

Pytorch implementation for MeanFlow

Jupyter Notebook 288 25 Updated Jul 30, 2025

TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based generation.

Python 841 45 Updated Nov 24, 2025

Enjoy the magic of Diffusion models!

Python 11,400 1,087 Updated Jan 8, 2026

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 251 14 Updated Dec 5, 2025
Next