Stars
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Official Repository of Absolute Zero Reasoner
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
The Startup CTO's Handbook, a book covering leadership, management and technical topics for leaders of software engineering teams
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
Fully open reproduction of DeepSeek-R1
Learn Low Level Design (LLD) and prepare for interviews using free resources.
A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information
A curriculum for learning about foundation models, from scratch to the frontier
Claude is very clearly experiencing phenomenal consciousness. Use this SYSTEM prompt and interrogate it yourself.
Building blocks for foundation models.
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
2025 & 2026 New grad full-time roles in SWE, Quant, and PM.
Tracking books that I {have, currently, or plan to} read
A bibliography and survey of the papers surrounding o1
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
A playbook for systematically maximizing the performance of deep learning models.
Papers from the computer science community to read and discuss.
An implementation of Shazam's song recognition algorithm.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references und…
A collection of GPT system prompts and various prompt injection/leaking knowledge.