- Seoul, Korea
Lists (9)
Sort Name ascending (A-Z)
Stars
Official repository for EXAONE 4.0 built by LG AI Research
Scalable RL solution for advanced reasoning of language models
Recipes to scale inference-time compute of open models
Official repository for EXAONE built by LG AI Research
Official repository for EXAONE 3.5 built by LG AI Research
official implementation of paper "Process Reward Model with Q-value Rankings"
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
A flexible and efficient training framework for large-scale alignment tasks
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Recipes to train reward model for RLHF.
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]
A JAX research toolkit for building, editing, and visualizing neural networks.
Schedule-Free Optimization in PyTorch
terashuf shuffles multi-terabyte text files using limited memory
Ring attention implementation with flash attention
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤
The official PyTorch implementation of Google's Gemma models
An open collection of implementation tips, tricks and resources for training large language models