Highlights
- Pro
Starred repositories
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
Scalable toolkit for efficient model reinforcement
Official repository of the NeurIPS 2025 Competition: The PokeAgent Challenge: Competitive and Long-Context Learning at Scale. (Track 2, Speedrunning)
[ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking
Code for the paper "Learning a Diffusion Model Policy from Rewards via Q-Score Matching"
FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
Interactive Markov-chain Monte Carlo Javascript demos
An interface library for RL post training with environments.
A Multi-Task Dataset for Simulated Humanoid Control
Post-training with Tinker
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
Suite of motion imitation methods for training controllers.
๐ A modern watch command. Time machine and pager etc.
ArcticInference: vLLM plugin for high-throughput, low-latency inference
A local-first LaTeX & Typst web editor with real-time collaboration & offline support
Catch MCP server issues before your agents do.
Platform for evaluating reinforcement learning (RL) algorithms on a physical Atari system.
Achieve state of the art inference performance with modern accelerators on Kubernetes
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
A framework for building, orchestrating and deploying AI agents and multi-agent workflows with support for Python and .NET.
JAX implementation of WSRL and RL baselines | ICLR 2025
A lightweight, local-first, and ๐ experiment tracking library from Hugging Face ๐ค
Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion models are significantly more data-efficient than standard leftโฆ
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
LEAKED SYSTEM PROMPTS FOR CHATGPT, GEMINI, GROK, CLAUDE, PERPLEXITY, CURSOR, DEVIN, REPLIT, AND MORE! - AI SYSTEMS TRANSPARENCY FOR ALL! ๐