RyanLiu112

Follow

🎯

Focusing

Runze Liu RyanLiu112

🎯

Focusing

Follow

Master's student @ THU

44 followers · 18 following

Tsinghua University
Qingdao
00:06 (UTC +08:00)
https://ryanliu112.github.io
https://scholar.google.com/citations?user=LiIfGakAAAAJ

Achievements

Achievements

Highlights

Pro

Pinned Loading

TsinghuaC3I/Awesome-RL-for-LRMs TsinghuaC3I/Awesome-RL-for-LRMs Public

A Survey of Reinforcement Learning for Large Reasoning Models

1.9k 106
TsinghuaC3I/MARTI TsinghuaC3I/MARTI Public

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 306 29
compute-optimal-tts compute-optimal-tts Public

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

Python 273 22
Awesome-Process-Reward-Models Awesome-Process-Reward-Models Public

A comprehensive collection of process reward models.

114 2
GenPRM GenPRM Public

Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

Python 84 2
wizard-III/ArcherCodeR wizard-III/ArcherCodeR Public

ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement learning.

Python 42 2