RL
verl: Volcano Engine Reinforcement Learning for LLMs
Agentic RAG R1 Framework via Reinforcement Learning
Train your Agent model via our easy and efficient framework
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning
Agent S: an open agentic framework that uses computers like a human
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
The absolute trainer to light up AI agents.
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
A live stream development of RL tunning for LLM agents
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
FinRL®: Financial Reinforcement Learning. 🔥
Train transformer language models with reinforcement learning.
An Open-Ended Embodied Agent with Large Language Models
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
A toolkit for developing and comparing reinforcement learning algorithms.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Deep Reinforcement Learning Hands-On, 3E_Published by Packt