Skip to content
View yyht's full-sized avatar

Block or report yyht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Really Scalable RL Framework to 10k+ CPUs

Python 38 3 Updated Feb 29, 2024

PyTorch-native post-training at scale

Python 586 73 Updated Jan 10, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 178 9 Updated Dec 16, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,249 4,012 Updated Jan 11, 2026
Shell 11 2 Updated Oct 22, 2025

An interface library for RL post training with environments.

Python 984 146 Updated Jan 9, 2026

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,092 113 Updated Jan 9, 2026

(best/better) practices of megatron on veRL and tuning guide

Shell 116 8 Updated Sep 26, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,068 210 Updated Jan 11, 2026
Python 856 45 Updated Sep 15, 2025

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 276 39 Updated Jan 11, 2026

A library for advanced large language model reasoning

Python 2,318 204 Updated Jun 10, 2025

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 216 26 Updated Sep 23, 2025

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 177 18 Updated Jul 6, 2025

Verlog: A Multi-turn RL framework for LLM agents

Python 67 7 Updated Jan 1, 2026

Official repo for IRL-VLA

75 4 Updated Aug 13, 2025

OpenCUA: Open Foundations for Computer-Use Agents

Python 633 77 Updated Jan 9, 2026

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Python 47 6 Updated Jan 8, 2026
Python 330 25 Updated Aug 29, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 532 34 Updated Nov 26, 2025

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 520 44 Updated Sep 8, 2025
Python 83 3 Updated Nov 17, 2025
Jupyter Notebook 71 2 Updated Aug 6, 2025

Toolchain built around the Megatron-LM for Distributed Training

Python 80 5 Updated Dec 7, 2025

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 212 16 Updated Aug 27, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,901 1,377 Updated Jan 8, 2026

rl from zero pretrain, can it be done? yes.

Python 285 21 Updated Sep 28, 2025

SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution

Python 101 5 Updated Sep 24, 2025

JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.

Python 42 4 Updated Jan 11, 2026
Next