Lists (12)
Sort Name ascending (A-Z)
Stars
A debugging and profiling tool that can trace and visualize python code execution
Official Problem Sets / Reference Kernels for the GPU MODE Leaderboard!
We want to compare how good Qwen3-1.7B-Base using B200 to continue pretraining on Malaysian multi-lingual corpus on different mixed precision training with proper truncated multi-packing.
Super basic implementation (gist-like) of RLMs with REPL environments.
Triton-based Symmetric Memory operators and examples
From babyGPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).
This is a beginner-friendly tutorial on MLIR from the perspective of a user of MLIR, not a compiler engineer. This tutorial will introduce why MLIR exists and how it is used to compile code at diff…
Use Claude Code with any LLM provider - GLM-4.5, Kimi-K2, Qwen3-Coder, DeepSeek, etc.
A non-saturating, open-ended environment for evaluating LLMs in Factorio
Renderer for the harmony response format to be used with gpt-oss
Hierarchical Reasoning Model Official Release
Official Repository of Absolute Zero Reasoner
Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse environments
A repository to unravel the language of GPUs, making their kernel conversations easy to understand
Collection of kernels written in Triton language
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling
🕷️ An undetectable, powerful, flexible, high-performance Python library to make Web Scraping Easy and Effortless as it should be!
Pocket Flow: 100-line LLM framework. Let Agents build Agents!