MozerWang

Minzheng_Wang MozerWang

Ph.D Student @ CASIA

37 followers · 87 following

Institute of Automation, Chinese Academy of Sciences
Beijing
15:54 (UTC +08:00)
https://mozerwang.github.io
@minzheng_wang
https://scholar.google.com/citations?user=glV21ZsAAAAJ
https://www.semanticscholar.org/author/Minzheng-Wang/2264515707

Achievements

Highlights

Lists (1)

Sort

Social Detection Game

8 repositories

Stars

Shenzhi-Wang / Beyond-the-80-20-Rule-RLVR

The open-source code for the NeurIPS 2025 paper, "Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning."

Python 26 1 Updated Nov 14, 2025

Gen-Verse / Open-AgentRL

Demystifying Reinforcement Learning in Agentic Reasoning

Python 116 21 Updated Oct 14, 2025

Trae1ounG / Zero_Step_Thinking

Official Code for NeurIPS'25 ER Workshop "The Zero-Step Thinking: An Empirical Study of Mode Selection as a Harder Early Exit Problem in Reasoning Models"

Python 3 Updated Oct 19, 2025

CLR-Lab / SimKO

SimKO: Simple Pass@K Policy Optimization

Python 21 1 Updated Oct 24, 2025

thu-nics / MARS

MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games

Python 13 Updated Nov 10, 2025

KANABOON1 / MemGen

MemGen: Weaving Generative Latent Memory for Self-Evolving Agents

Python 180 15 Updated Nov 1, 2025

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 1,979 154 Updated Nov 17, 2025

PRIME-RL / Entropy-Mechanism-of-RL

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 381 12 Updated Jul 11, 2025

tongjingqi / Awesome-Agent-RL

A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical guides on defining and collecting rewards to build more inte…

49 Updated Sep 1, 2025