RockeyCoss

😧

Rockey RockeyCoss

😧

PhD Candidate @ ANU | Research Intern @ByteDance-Seed

84 followers · 290 following

the Solar System

Achievements

x2 x2

Achievements

x2 x2

Highlights

Lists (4)

Sort

Stars

GaParmar / group-inference

Scalable group inference for generating high quality and diverse images with diffusion models.

Python 38 1 Updated Aug 31, 2025

LightCooling / flux2-vton-lora

LoRA fine-tuning for FLUX.2 to improve virtual try-on (VTON) capabilities

Python 2 Updated Dec 9, 2025

gcorso / particle-guidance

Implementation of Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

Python 76 3 Updated Oct 23, 2023

alibaba-damo-academy / T2I-Distill

[Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide

Python 321 21 Updated Dec 31, 2025

nvidia-cosmos / cosmos-predict2.5

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

Python 623 57 Updated Jan 5, 2026

End2End-Diffusion / iREPA

Official implementation for What matters for Representation Alignment: Global Information or Spatial Structure?

Python 174 7 Updated Dec 15, 2025

Alibaba-Quark / LiveAvatar

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,364 130 Updated Dec 30, 2025

deepseek-ai / DeepSeek-Math-V2

Python 1,513 120 Updated Dec 1, 2025

Tongyi-MAI / Z-Image

Python 8,812 524 Updated Jan 7, 2026

bebebe666 / OptimalSteps

Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".

Python 193 12 Updated Apr 13, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,944 121 Updated Dec 8, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,869 310 Updated Jun 12, 2025

BarretBa / ICTHP

Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment [ICCV 2025] - Official implementation

Python 41 1 Updated Aug 5, 2025

meituan-longcat / LongCat-Video

Python 1,872 262 Updated Dec 20, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,685 58 Updated Dec 26, 2025

discus0434 / aesthetic-predictor-v2-5

SigLIP-based Aesthetic Score Predictor

Python 369 8 Updated Dec 18, 2024

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,907 398 Updated Dec 31, 2025

NVlabs / DiffusionNFT

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 536 18 Updated Jan 6, 2026

charlie0129 / batt

Control and limit battery charging on Apple Silicon MacBooks.

Go 1,346 54 Updated Jan 7, 2026

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,227 202 Updated Jan 8, 2026

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,237 40 Updated Oct 26, 2025

IamCreateAI / FlowCPS

Forked from yifan123/flow_grpo

An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching

Python 57 3 Updated Sep 11, 2025

Rockey RockeyCoss

Highlights

Lists (4)

🔮 Future ideas

✨ Inspiration

🚀 My stack

reward_models

Stars