CoderBak

Follow

Haoxiang Sun CoderBak

Follow

CS + Fin @ Renmin Univ. of China. Currently interning @mll-lab-nu.

20 followers · 13 following

hxiang-sun.com

Achievements

Achievements

Pinned Loading

mll-lab-nu/RAGEN mll-lab-nu/RAGEN Public

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2.4k 185
RUCAIBox/OlymMATH RUCAIBox/OlymMATH Public

The OlymMATH dataset

Python 20
RUCAIBox/Slow_Thinking_with_LLMs RUCAIBox/Slow_Thinking_with_LLMs Public

A series of technical report on Slow Thinking with LLM

Python 747 41
RUCAIBox/LLMBox RUCAIBox/LLMBox Public

A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.

Python 845 106