-
slime Public
Forked from THUDM/slimeslime is an LLM post-training framework for RL Scaling.
Python Apache License 2.0 UpdatedOct 24, 2025 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 18, 2025 -
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 3 minutes
Python MIT License UpdatedSep 17, 2025 -
vllm-2015aroras Public
Forked from 2015aroras/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedSep 2, 2025 -
tyler-romero.github.io Public
Technical Blog + Personal Website
-
-
transformers Public
Forked from 2015aroras/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
LLMPlaysPokemon Public
Forked from davidhershey/ClaudePlaysPokemonStarterPython UpdatedJun 9, 2025 -
OLMo-core Public
Forked from allenai/OLMo-corePyTorch building blocks for the OLMo ecosystem
Python Apache License 2.0 UpdatedJun 8, 2025 -
HeavyBall Public
Forked from HomebrewML/HeavyBallEfficient optimizers
Python BSD 2-Clause "Simplified" License UpdatedMay 18, 2025 -
nanogpt-speedrun Public
NanoGPT (124M) as fast as possible
-
open-instruct Public
Forked from allenai/open-instructAllenAI's post-training codebase
Python Apache License 2.0 UpdatedMar 11, 2025 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedFeb 7, 2025 -
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedFeb 7, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedFeb 6, 2025 -
microR1 Public
Simple repository for training small reasoning models
-
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
-
seahorse Public
A small vision language model meant for research
-
aegae Public
Learning Triton / CUDA
-
vlm-evaluation Public
Forked from TRI-ML/vlm-evaluationVLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning
Python Other UpdatedJul 15, 2024 -
prismatic-vlm Public
Forked from TRI-ML/prismatic-vlmsA flexible and efficient codebase for training visually-conditioned language models (VLMs)
Python MIT License UpdatedJul 4, 2024 -
ESP32-e-Paper-Weather-Display Public
Forked from G6EJD/ESP32-e-Paper-Weather-DisplayAn ESP32 and 2.9", 4.2" or 7.5" ePaper Display reads Weather Underground data via their API and then displays the weather
C Other UpdatedJun 22, 2024 -
LLaVA Public
Forked from haotian-liu/LLaVA[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.
-
YOLOX Public
Forked from Megvii-BaseDetection/YOLOXYOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Python Apache License 2.0 UpdatedAug 30, 2023 -
neel-plotly Public
Forked from neelnanda-io/neel-plotlyA very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorch
Python Apache License 2.0 UpdatedJun 16, 2023 -
-
Ax Public
Forked from facebook/AxAdaptive Experimentation Platform
Jupyter Notebook MIT License UpdatedJul 16, 2022 -
bibtex-js Public
Forked from pcooksey/bibtex-jsBibTeX-js can parse a BibTeX-file and render it as part of an HTML file. This way, you can easily add a list of publications to your private homepage or display a list of recommended publications f…
JavaScript MIT License UpdatedJul 27, 2020 -
frankie-ggp Public
Forked from hardiecate/ggp-baseMCTS for general game playing
Java UpdatedJun 10, 2017 -