Lists (32)
Sort Name ascending (A-Z)
attention
awesome-paper-list
compiler
CVPR
daily-paper
debugging
deep-reasoning
diffusion training
english-speak
Eurosys
framework
gpu-programming
image-modesl
inference
kernel
learn
LLM-serving
memory-management
multi-modal
network
NIPS25
OSDI
pipeline-parallelsim
Profiler
RL frameworks
RLHF
simulator
sparse
tools
video_generation_model
world-models
可视化
Starred repositories
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
A Survey of Efficient Attention Methods: Hardware-efficient, Sparse, Compact, and Linear Attention
Some out-of-the-box hooks for pre-commit
An agentic system for autonomously generating explainable and reproducible time-series anomaly detection rules using LLMs.
[EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…
[NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
PyTorch implementation of One-step Diffusion with Distribution Matching Distillation
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
torchcomms: a modern PyTorch communications API
A pipeline parallel training script for diffusion models.
MrlX: A Multi-Agent Reinforcement Learning Framework
Krea Realtime 14B. An open-source realtime AI video model.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Democratizing AI scientists with ToolUniverse
Collect some World Models for Autonomous Driving (and Robotic) papers.
[SIGGRAPH Asia 2025] WorldExplorer: Towards Generating Fully Navigable 3D Scenes
Implement a reasoning LLM in PyTorch from scratch, step by step
NCCL communication API layer, and transport layer created from first principles.
Post-training with Tinker