- My research primarily focuses on Reinforcement Fine-Tuning (RFT) and Vision-Language Models (VLMs). Additionally, I am exploring browser-based and computer-use agent systems. If you're interested in collaboration, feel free to reach out!
- Core Contributor of Skywork-R1V3, more technical details please refers to 📰tech report
- Main Contributor of MOSS-RLHF, more technical details please refers to 📰secrest of RLHF par1
- Here is my academic page
🎯
Focusing
Focus on LLM Alignment (RLHF)
-
Fudan University
- shanghai
-
12:06
(UTC -12:00)
Pinned Loading
-
-
fakerbaby.github.io
fakerbaby.github.io PublicGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
JavaScript
-
Skywork-R1V
Skywork-R1V PublicForked from SkyworkAI/Skywork-R1V
Skywork-R1V3: Advanced Multimodal Reasoning Model
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.