CS + Fin @ Renmin Univ. of China.
Currently interning @mll-lab-nu.
Pinned Loading
-
mll-lab-nu/RAGEN
mll-lab-nu/RAGEN PublicRAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
-
-
RUCAIBox/Slow_Thinking_with_LLMs
RUCAIBox/Slow_Thinking_with_LLMs PublicA series of technical report on Slow Thinking with LLM
-
RUCAIBox/LLMBox
RUCAIBox/LLMBox PublicA comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.