tyler-romero

Tyler Romero tyler-romero

@allenai pretraining

39 followers · 43 following

slime Public
Forked from THUDM/slime

slime is an LLM post-training framework for RL Scaling.

Python Apache License 2.0 Updated Oct 24, 2025
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python BSD 3-Clause "New" or "Revised" License Updated Sep 18, 2025
modded-nanogpt Public
Forked from KellerJordan/modded-nanogpt

NanoGPT (124M) in 3 minutes

Python MIT License Updated Sep 17, 2025
vllm-2015aroras Public
Forked from 2015aroras/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Sep 2, 2025
tyler-romero.github.io Public

Technical Blog + Personal Website

blog ml multimodal llms

Nunjucks 3 Updated Aug 26, 2025
wedding-website Public

HTML Updated Aug 17, 2025
transformers Public
Forked from 2015aroras/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 1 Apache License 2.0 Updated Jul 14, 2025
LLMPlaysPokemon Public
Forked from davidhershey/ClaudePlaysPokemonStarter

Python Updated Jun 9, 2025
OLMo-core Public
Forked from allenai/OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python Apache License 2.0 Updated Jun 8, 2025
HeavyBall Public
Forked from HomebrewML/HeavyBall

Efficient optimizers

Python BSD 2-Clause "Simplified" License Updated May 18, 2025
nanogpt-speedrun Public

NanoGPT (124M) as fast as possible

Python 14 5 MIT License Updated Apr 15, 2025
open-instruct Public
Forked from allenai/open-instruct

AllenAI's post-training codebase

Python Apache License 2.0 Updated Mar 11, 2025
trl Public
Forked from huggingface/trl

Train transformer language models with reinforcement learning.

Python Apache License 2.0 Updated Feb 7, 2025
verl Public
Forked from volcengine/verl

veRL: Volcano Engine Reinforcement Learning for LLM

Python Apache License 2.0 Updated Feb 7, 2025
OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python Apache License 2.0 Updated Feb 6, 2025
microR1 Public

Simple repository for training small reasoning models

reasoning r1 deepseek grpo

Python 44 6 MIT License Updated Feb 6, 2025
Liger-Kernel Public
Forked from linkedin/Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 2 BSD 2-Clause "Simplified" License Updated Jan 16, 2025
seahorse Public

A small vision language model meant for research

vlm vision-language-model

Python 4 MIT License Updated Oct 16, 2024
aegae Public

Learning Triton / CUDA

cuda triton

Jupyter Notebook 2 Apache License 2.0 Updated Sep 6, 2024
vlm-evaluation Public
Forked from TRI-ML/vlm-evaluation

VLM Evaluation: Benchmark for VLMs, spanning text generation tasks from VQA to Captioning

Python Other Updated Jul 15, 2024
prismatic-vlm Public
Forked from TRI-ML/prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python MIT License Updated Jul 4, 2024
ESP32-e-Paper-Weather-Display Public
Forked from G6EJD/ESP32-e-Paper-Weather-Display

An ESP32 and 2.9", 4.2" or 7.5" ePaper Display reads Weather Underground data via their API and then displays the weather

C Other Updated Jun 22, 2024
LLaVA Public
Forked from haotian-liu/LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

Python 1 Apache License 2.0 Updated Feb 8, 2024
YOLOX Public
Forked from Megvii-BaseDetection/YOLOX

YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

Python Apache License 2.0 Updated Aug 30, 2023
neel-plotly Public
Forked from neelnanda-io/neel-plotly

A very hacky set of functions for getting plotly to do what I want when doing mech interp research, designed to be compatible with PyTorch

Python Apache License 2.0 Updated Jun 16, 2023
LandUseClassification Public

Jupyter Notebook 1 Other Updated Oct 24, 2022
Ax Public
Forked from facebook/Ax

Adaptive Experimentation Platform

Jupyter Notebook MIT License Updated Jul 16, 2022
bibtex-js Public
Forked from pcooksey/bibtex-js

BibTeX-js can parse a BibTeX-file and render it as part of an HTML file. This way, you can easily add a list of publications to your private homepage or display a list of recommended publications f…

JavaScript MIT License Updated Jul 27, 2020
frankie-ggp Public
Forked from hardiecate/ggp-base

MCTS for general game playing

monte-carlo-tree-search general-game-playing

Java Updated Jun 10, 2017
deep-rl-pong Public

A Deep-Q-Learning Agent for Pong

reinforcement-learning deep-q-learning

Python Updated May 1, 2017

Tyler Romero tyler-romero

slime Public

Uh oh!

flash-attention Public

Uh oh!

modded-nanogpt Public

Uh oh!

vllm-2015aroras Public

Uh oh!

tyler-romero.github.io Public

Uh oh!

wedding-website Public

Uh oh!

transformers Public

Uh oh!

LLMPlaysPokemon Public

Uh oh!

OLMo-core Public

Uh oh!

HeavyBall Public

Uh oh!

nanogpt-speedrun Public

Uh oh!

open-instruct Public

Uh oh!

trl Public

Uh oh!

verl Public

Uh oh!

OpenRLHF Public

Uh oh!

microR1 Public

Uh oh!

Liger-Kernel Public

Uh oh!

seahorse Public

Uh oh!

aegae Public

Uh oh!

vlm-evaluation Public

Uh oh!

prismatic-vlm Public

Uh oh!

ESP32-e-Paper-Weather-Display Public

Uh oh!

LLaVA Public

Uh oh!

YOLOX Public

Uh oh!

neel-plotly Public

Uh oh!

LandUseClassification Public

Uh oh!

Ax Public

Uh oh!

bibtex-js Public

Uh oh!

frankie-ggp Public

Uh oh!

deep-rl-pong Public

Uh oh!