Lists (2)
Sort Name ascending (A-Z)
Stars
Fast, small, and fully autonomous AI assistant infrastructure — deploy anywhere, swap anything 🦀
Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.
SGLang is a high-performance serving framework for large language models and multimodal models.
Render After Effects animations natively on Web, Android and iOS, and React Native. http://airbnb.io/lottie/
[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
A framework for few-shot evaluation of language models.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
FastAPI framework, high performance, easy to learn, fast to code, ready for production
Example code for Fluent Python, 2nd edition (O'Reilly 2022)
Understanding Deep Learning - Simon J.D. Prince
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
Fast and memory-efficient exact attention
You like pytorch? You like micrograd? You love tinygrad! ❤️
Add-on agent to generate and expose cluster-level metrics.
Solve puzzles. Improve your pytorch.
Efficient Triton Kernels for LLM Training
Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
A comprehensive catalog of modern and classic books on C++ programming language
My own templates and implementation of important algorithms and data structures for competitive programming
Empowering everyone to build reliable and efficient software.