Lists (1)
Sort Name ascending (A-Z)
Stars
Visualizer for neural network, deep learning and machine learning models
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
A bunch of triton kernels with increasing complexity for learning and exploring triton and GPU programming
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Code accompanying the paper "Generalized Interpolating Discrete Diffusion"
Minimalistic large language model 3D-parallelism training
TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Efficient Triton Kernels for LLM Training
PyTorch building blocks for the OLMo ecosystem
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
Democratizing Reinforcement Learning for LLMs
wolfecameron / nanoMoE
Forked from karpathy/nanoGPTAn extension of the nanoGPT repository for training small MOE models.
Stanford Drone Dataset with non-convex Constraints
Sum-of-squares Non-monotonic Probabilistic Circuits
A computer algebra system written in pure Python
Code for "TabZilla: When Do Neural Nets Outperform Boosted Trees on Tabular Data?"
Official implementation of E(n)-equivariant Graph Neural Cellular Automata
A New Modeling Framework for Continuous, Sequential Domains
Code release for Hoogeboom, Emiel, Jorn WT Peters, Rianne van den Berg, and Max Welling. "Integer Discrete Flows and Lossless Compression." Conference on Neural Information Processing Systems (2019).