SE-Agent is a self-evolution framework for LLM Code agents. It enables trajectory-level evolution to exchange information across reasoning paths via Revision, Recombination, and Refinement, expandi…

Python 220 28 Updated Sep 23, 2025

OPPO-PersonalAI / TaskCraft

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 177 18 Updated Jul 6, 2025

WentseChen / Verlog

Forked from volcengine/verl

Verlog: A Multi-turn RL framework for LLM agents

Python 67 7 Updated Jan 16, 2026

IRL-VLA / IRL-VLA

Official repo for IRL-VLA

74 4 Updated Aug 13, 2025

xlang-ai / OpenCUA

OpenCUA: Open Foundations for Computer-Use Agents

Python 642 78 Updated Jan 17, 2026

bytedance / FTRL

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Python 48 6 Updated Jan 8, 2026

InternLM / InternBootcamp

Python 332 25 Updated Aug 29, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 538 34 Updated Nov 26, 2025

OPPO-PersonalAI / Agent_Foundation_Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 524 44 Updated Sep 8, 2025

DreamLM / Dream-Coder

Python 84 3 Updated Nov 17, 2025

ctlllll / gpt-oss-reverse-engineering

Jupyter Notebook 71 2 Updated Aug 6, 2025

OpenSQZ / MegatronApp

Toolchain built around the Megatron-LM for Distributed Training

Python 82 5 Updated Dec 7, 2025

MiroMindAI / MiroRL

MiroRL is an MCP-first reinforcement learning framework for deep research agent.

Python 220 18 Updated Aug 27, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,975 1,380 Updated Jan 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yyht

Achievements

Achievements

Block or report yyht

Stars

Alibaba-NLP / qqr

SagnikMukherjee / sparsity_in_rl

Unakar / Spectral-Sphere-Optimizer

ByteDance-Seed / Seed-Prover

openpsi-project / srl

meta-pytorch / torchforge

THUDM / AgentRL

sgl-project / sglang

InfiXAI / InfiR2

meta-pytorch / OpenEnv

inclusionAI / AWorld

ISEEKYAN / verl_megatron_practice

RLinf / RLinf

ByteDance-Seed / seed-oss

nvidia-cosmos / cosmos-rl

maitrix-org / llm-reasoners

JARVIS-Xs / SE-Agent