Skip to content
View yyht's full-sized avatar

Block or report yyht

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

qqr is an RL training framework for open-ended agents.

Python 155 11 Updated Jan 16, 2026

Reinforcement Learning Finetunes Small Subnetworks in Large Language Models

Python 10 4 Updated Oct 20, 2025

Spectral Sphere Optimizer

Python 66 1 Updated Jan 14, 2026

A Really Scalable RL Framework to 10k+ CPUs

Python 38 3 Updated Feb 29, 2024

PyTorch-native post-training at scale

Python 597 77 Updated Jan 18, 2026

Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Python 189 10 Updated Jan 17, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,555 4,098 Updated Jan 19, 2026
Shell 11 2 Updated Oct 22, 2025

An interface library for RL post training with environments.

Python 1,061 159 Updated Jan 16, 2026

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,101 113 Updated Jan 19, 2026

(best/better) practices of megatron on veRL and tuning guide

Shell 120 8 Updated Sep 26, 2025

RLinf: Reinforcement Learning Infrastructure for Embodied and Agentic AI

Python 2,181 225 Updated Jan 18, 2026
Python 860 45 Updated Sep 15, 2025

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Python 287 41 Updated Jan 19, 2026

A library for advanced large language model reasoning

Python 2,322 203 Updated Jun 10, 2025

SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 220 28 Updated Sep 23, 2025

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 177 18 Updated Jul 6, 2025

Verlog: A Multi-turn RL framework for LLM agents

Python 67 7 Updated Jan 16, 2026

Official repo for IRL-VLA

74 4 Updated Aug 13, 2025

OpenCUA: Open Foundations for Computer-Use Agents

Python 642 78 Updated Jan 17, 2026

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Python 48 6 Updated Jan 8, 2026
Python 332 25 Updated Aug 29, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 538 34 Updated Nov 26, 2025

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 524 44 Updated Sep 8, 2025
Python 84 3 Updated Nov 17, 2025
Jupyter Notebook 71 2 Updated Aug 6, 2025

Toolchain built around the Megatron-LM for Distributed Training

Python 82 5 Updated Dec 7, 2025

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 220 18 Updated Aug 27, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,975 1,380 Updated Jan 12, 2026
Next