User-friendly implementation of the Mixture-of-Sparse-Attention (MoSA). MoSA selects distinct tokens for each head with expert choice routing providing a content-based sparse attention mechanism.

Python 28 4 Updated May 3, 2025

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,316 4,544 Updated Dec 22, 2025

lllyasviel / FramePack

Lets make video diffusion practical!

Python 16,502 1,616 Updated Oct 16, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,108 1,836 Updated Jan 9, 2026

thu-ml / SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda 895 79 Updated Dec 31, 2025

canopyai / Orpheus-TTS

Towards Human-Sounding Speech

Python 5,867 507 Updated Dec 5, 2025

jerber / lang-jepa

Python 131 13 Updated Dec 23, 2024

eqimp / hogwild_llm

Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache

Python 138 9 Updated Aug 13, 2025

test-time-training / ttt-video-dit

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,342 194 Updated Jun 5, 2025

alexanderswerdlow / unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 134 5 Updated Apr 2, 2025

adityabingi / Dreamer

Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite

Python 46 12 Updated Dec 27, 2022

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 9,398 1,077 Updated Dec 16, 2025

lamm-mit / SciAgentsDiscovery

Python 577 100 Updated May 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fangyuan Yu fangyuan-ksgk

Achievements

Achievements

Block or report fangyuan-ksgk

Stars

Farama-Foundation / Metaworld

LTH14 / JiT

galilai-group / lejepa

TextArena / TextArena

open-compass / VLMEvalKit

sdan / vlm-gym

VsonicV / es-fine-tuning-paper

facebookresearch / MobileLLM-R1

fangyuan-ksgk / abstraction-learning

Simple-Efficient / RL-Factory

marin-community / marin

sapientinc / HRM

ScalingIntelligence / KernelBench

SkyworkAI / SkyReels-V2

Kai-46 / minFM

Multiverse4FM / Multiverse

helblazer811 / Diffusion-Explorer

piotrpiekos / MoSA