Starred repositories
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
Democratizing Reinforcement Learning for LLMs
slime is an LLM post-training framework for RL Scaling.
Bridge Megatron-Core to Hugging Face/Reinforcement Learning
Tools for merging pretrained large language models.
Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
An Open-Source Large-Scale Reinforcement Learning Project for Search Agents
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Pip compatible CodeBLEU metric implementation available for linux/macos/win
⚡️ Express inspired web framework written in Go
Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving
My learning notes/codes for ML SYS.
SkyRL: A Modular Full-stack RL Library for LLMs
DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation
A blazingly fast JSON serializing & deserializing library
[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning
A project to improve skills of large language models
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
A live stream development of RL tunning for LLM agents
Lightweight coding agent that runs in your terminal
This repository contains the official implementation of Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models
An Open Large Reasoning Model for Real-World Solutions
A PyTorch native platform for training generative AI models
🔥 A minimal training framework for scaling FLA models