-
Zhejiang Normal University
- Zhejiang, China
-
19:18
(UTC +08:00) - https://ghh1125.github.io
Highlights
- Pro
Stars
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
awesome synthetic (text) datasets
A reading list on LLM based Synthetic Data Generation 🔥
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.
Introduction about AWESOME_ENTROPY+LRM_PAPERS
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as DeepSeek-R1 and OpenAI o1, which are currently very popular.
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains papers, codes, datasets, evaluations, and analyses.
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
verl: Volcano Engine Reinforcement Learning for LLMs
An Open-source RL System from ByteDance Seed and Tsinghua AIR
🌐 Permanent Hosting Site: http://ai-paper-finder.info/ 🌐 Hugging Face Hosting: https://huggingface.co/spaces/wenhanacademia/ai-paper-finder
AgentScope: Agent-Oriented Programming for Building LLM Applications
Continuously updated paper list on advancements in Data Agents. Companion repo to our paper "A Survey of Data Agents: Emerging Paradigm or Overstated Hype?"
Official Repo of "RobustFlow: Towards Robust Agentic Workflow Generation"
[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
A Survey of Reinforcement Learning for Large Reasoning Models
Official Repo of "Code2MCP: Transforming Code Repositories into MCP Services", Scaling Environments for Agents Workshop @ NeurIPS 2025
12 Lessons to Get Started Building AI Agents
RepoMaster: The open-source AI agent that masters GitHub. It turns any code repository into a powerful tool, achieving a new level of autonomous task-solving. An open alternative to Claude-Code.
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
(NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning
The official repository for "Rongsheng Wang's Arxiv Template"
Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools