SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 216 26 Updated Sep 23, 2025

OPPO-PersonalAI / TaskCraft

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 177 18 Updated Jul 6, 2025

WentseChen / Verlog

Forked from volcengine/verl

Verlog: A Multi-turn RL framework for LLM agents

Python 67 7 Updated Jan 1, 2026

IRL-VLA / IRL-VLA

Official repo for IRL-VLA

75 4 Updated Aug 13, 2025

xlang-ai / OpenCUA

OpenCUA: Open Foundations for Computer-Use Agents

Python 633 77 Updated Jan 9, 2026

bytedance / FTRL

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Python 47 6 Updated Jan 8, 2026

InternLM / InternBootcamp

Python 330 25 Updated Aug 29, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 532 34 Updated Nov 26, 2025

OPPO-PersonalAI / Agent_Foundation_Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 520 44 Updated Sep 8, 2025

DreamLM / Dream-Coder

Python 83 3 Updated Nov 17, 2025

ctlllll / gpt-oss-reverse-engineering

Jupyter Notebook 71 2 Updated Aug 6, 2025

OpenSQZ / MegatronApp

Toolchain built around the Megatron-LM for Distributed Training

Python 80 5 Updated Dec 7, 2025

MiroMindAI / MiroRL

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 212 16 Updated Aug 27, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,901 1,377 Updated Jan 8, 2026

tokenbender / avataRL

rl from zero pretrain, can it be done? yes.

Python 285 21 Updated Sep 28, 2025

zhenyuhe00 / SWE-Swiss

SWE-Swiss: A Multi-Task Fine-Tuning and RL Recipe for High-Performance Issue Resolution

Python 101 5 Updated Sep 24, 2025

tinker495 / JAxtar

JAxtar is a project with a JAX-native implementation of parallelizeable A* & Q* solver for neural heuristic search research.

Python 42 4 Updated Jan 11, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yyht

Achievements

Achievements

Block or report yyht

Stars

ByteDance-Seed / Seed-Prover

openpsi-project / srl

meta-pytorch / torchforge

THUDM / AgentRL

sgl-project / sglang

InfiXAI / InfiR2

meta-pytorch / OpenEnv

inclusionAI / AWorld

ISEEKYAN / verl_megatron_practice

RLinf / RLinf

ByteDance-Seed / seed-oss

nvidia-cosmos / cosmos-rl

maitrix-org / llm-reasoners

JARVIS-Xs / SE-Agent