-
alibaba-inc
- Beijing, China
Highlights
- Pro
Stars
The Cursor for Designers • An Open-Source AI-First Design tool • Visually build, style, and edit your React App with AI
Miles is an enterprise-facing reinforcement learning framework for large-scale MoE post-training and production workloads, forked from and co-evolving with slime.
A construction kit for reinforcement learning environment management.
Spec-driven development (SDD) for AI coding assistants.
ValueCell is a community-driven, multi-agent platform for financial applications.
Intelligent automation and multi-agent orchestration for Claude Code
A Lightweight LLM Inference Performance Simulator
histmeisah / ROLL
Forked from alibaba/ROLLAn Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
A unified architecture deep learning framework designed specifically for ultra-large-scale sparse models.
Build, evaluate and train General Multi-Agent Assistance with ease
Production-ready platform for agentic workflow development.
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Awesome-LLM: a curated list of Large Language Model
End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"
A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and po…
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games
verl: Volcano Engine Reinforcement Learning for LLMs
Super-Efficient RLHF Training of LLMs with Parameter Reallocation