bradhilton

Brad Hilton bradhilton

Reinforcement Learning Research Engineer

130 followers · 8 following

Ender Research Corp

Achievements

x2 x4 x3

Achievements

x2 x4 x3

Organizations

Stars

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 549 67 Updated Nov 27, 2025

shangshang-wang / Tora

Forked from meta-pytorch/torchtune

Tora: Torchtune-LoRA for RL

Python 71 7 Updated Nov 12, 2025

thinking-machines-lab / batch_invariant_ops

Python 914 71 Updated Nov 4, 2025

muellerzr / nbdistributed

Seemless interface of using PyTOrch distributed with Jupyter notebooks

Jupyter Notebook 56 13 Updated Sep 15, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,943 625 Updated Nov 27, 2025

diffusionstudio / webcodecs-scroll-sync

TypeScript 101 8 Updated Aug 16, 2025

ButteredFire / Astrocelerate

C++/Vulkan simulation engine

C++ 273 15 Updated Nov 5, 2025

charmbracelet / crush

The glamourous AI coding agent for your favourite terminal 💘

Go 15,406 886 Updated Nov 29, 2025

OpenPipe / art-star-count

Display ART repository star count on a tablet

HTML 1 Updated Jul 14, 2025

OpenPipe / art-langgraph

Python 5 Updated Jul 18, 2025

haykgrigo3 / TimeCapsuleLLM

A LLM trained only on data from certain time periods to reduce modern bias

Python 634 24 Updated Nov 16, 2025

unslothai / unsloth-zoo

Utils for Unsloth https://github.com/unslothai/unsloth

Python 177 172 Updated Nov 30, 2025

tokenbender / avataRL

rl from zero pretrain, can it be done? yes.

Python 281 21 Updated Sep 28, 2025

Maciek-roboblog / Claude-Code-Usage-Monitor

Real-time Claude Code usage monitor with predictions and warnings

Python 5,789 278 Updated Sep 14, 2025

dagger / container-use

Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack.

Go 3,287 174 Updated Nov 24, 2025

modal-labs / modal-client

Python client library for Modal

Python 417 72 Updated Nov 28, 2025

SWE-bench / SWE-bench

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 3,878 700 Updated Nov 15, 2025

SWE-agent / SWE-ReX

Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.

Python 378 89 Updated Nov 24, 2025

SWE-bench / SWE-smith

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 467 82 Updated Nov 24, 2025

lechmazur / elimination_game

A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other

293 10 Updated Aug 14, 2025