User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,323 4,543 Updated Dec 22, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,520 1,623 Updated Oct 16, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,174 1,844 Updated Jan 9, 2026

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 905 81 Updated Dec 31, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,885 506 Updated Dec 5, 2025

jerber / lang-jepa

Python 133 13 Updated Dec 23, 2024

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 140 9 Updated Aug 13, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,350 194 Updated Jun 5, 2025

alexanderswerdlow / unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 134 5 Updated Apr 2, 2025

adityabingi / Dreamer

Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite

Python 47 12 Updated Dec 27, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fangyuan Yu fangyuan-ksgk

Achievements

Achievements

Block or report fangyuan-ksgk

Stars

meta-pytorch / OpenEnv

UniPat-AI / BabyVision

Farama-Foundation / Metaworld

LTH14 / JiT

galilai-group / lejepa

TextArena / TextArena

open-compass / VLMEvalKit

sdan / vlm-gym

VsonicV / es-fine-tuning-paper

facebookresearch / MobileLLM-R1

fangyuan-ksgk / abstraction-learning

Simple-Efficient / RL-Factory

marin-community / marin

sapientinc / HRM

ScalingIntelligence / KernelBench

SkyworkAI / SkyReels-V2

Kai-46 / minFM

Multiverse4FM / Multiverse

helblazer811 / Diffusion-Explorer

piotrpiekos / MoSA