Skip to content
View KenyonY's full-sized avatar
🐾
On vacation
🐾
On vacation
  • Shanghai
  • 19:57 (UTC +08:00)

Organizations

@ml-natural-language-processing

Block or report KenyonY

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

rl

5 repositories

Solve Visual Understanding with Reinforced VLMs

Python 5,702 370 Updated Oct 21, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,087 306 Updated Nov 15, 2025

RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.

Python 74 11 Updated Feb 19, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,404 1,523 Updated Apr 24, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,094 2,593 Updated Nov 19, 2025