Skip to content
View bradhilton's full-sized avatar
  • Ender Research Corp

Organizations

@Skyvive @Zewo

Block or report bradhilton

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch-native post-training at scale

Python 549 67 Updated Nov 27, 2025

Tora: Torchtune-LoRA for RL

Python 71 7 Updated Nov 12, 2025

Seemless interface of using PyTOrch distributed with Jupyter notebooks

Jupyter Notebook 56 13 Updated Sep 15, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,943 625 Updated Nov 27, 2025

C++/Vulkan simulation engine

C++ 273 15 Updated Nov 5, 2025

The glamourous AI coding agent for your favourite terminal 💘

Go 15,406 886 Updated Nov 29, 2025

Display ART repository star count on a tablet

HTML 1 Updated Jul 14, 2025
Python 5 Updated Jul 18, 2025

A LLM trained only on data from certain time periods to reduce modern bias

Python 634 24 Updated Nov 16, 2025

Utils for Unsloth https://github.com/unslothai/unsloth

Python 177 172 Updated Nov 30, 2025

rl from zero pretrain, can it be done? yes.

Python 281 21 Updated Sep 28, 2025

Real-time Claude Code usage monitor with predictions and warnings

Python 5,789 278 Updated Sep 14, 2025

Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack.

Go 3,287 174 Updated Nov 24, 2025

Python client library for Modal

Python 417 72 Updated Nov 28, 2025

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 3,878 700 Updated Nov 15, 2025

Sandboxed code execution for AI agents, locally or on the cloud. Massively parallel, easy to extend. Powering SWE-agent and more.

Python 378 89 Updated Nov 24, 2025

[NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents

Python 467 82 Updated Nov 24, 2025

A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private conversations, form alliances, and vote to eliminate each other

293 10 Updated Aug 14, 2025

Detect and redact PII locally with SOTA performance

Python 85 15 Updated Mar 25, 2025

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 528 26 Updated Mar 27, 2025

Crowdsourcing the search for compute-optimal RLVR

7 Updated Mar 12, 2025
Ruby 164 10 Updated Mar 24, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,917 758 Updated Nov 25, 2025

Prisma Client Python is an auto-generated and fully type-safe database client designed for ease of use

Python 2,114 87 Updated Apr 10, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,239 102 Updated Nov 13, 2025

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)

Python 427 36 Updated Oct 23, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,735 272 Updated Jul 18, 2025
Next