Stars
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Training Small Language Model
🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.
LLMs-from-scratch项目中文翻译
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Model parallel transformers in JAX and Haiku
A monospaced programming font inspired by the Minecraft typeface
The DeepJSON benchmark for JSON Output
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Resource list for generating JSON using LLMs via function calling, tools, CFG. Libraries, Models, Notebooks, etc.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Modern C++ Programming Course (C++03/11/14/17/20/23/26)
"Context engineering is the delicate art and science of filling the context window with just the right information for the next step." — Andrej Karpathy. A frontier, first-principles handbook inspi…
SGLang is a fast serving framework for large language models and vision language models.
Reproduction Package for the paper "Type-Constrained Code Generation with Language Models" [PLDI 2025]
A Bulletproof Way to Generate Structured JSON from Language Models
Learning to Generate STRUCTURED Output with Schema Reinforcement Learning
Everything about the SmolLM and SmolVLM family of models
A language agnostic test suite for the JSON Schema specifications
A quick guide (especially) for trending instruction finetuning datasets