- My research primarily focuses on Reinforcement Fine-Tuning (RFT) and Vision-Language Models (VLMs). Additionally, I am exploring browser-based and computer-use agent systems. If you're interested in collaboration, feel free to reach out!
- Core Contributor of Skywork-R1V3, more technical details please refers to 📰tech report
- Main Contributor of MOSS-RLHF, more technical details please refers to 📰secrest of RLHF par1
- Here is my academic page
🎯
    Focusing
    Focus on LLM Alignment (RLHF)
- 
                  Fudan University
- shanghai
- 
        
  12:06
  (UTC -12:00) 
Pinned Loading
- 
  
- 
  fakerbaby.github.iofakerbaby.github.io PublicGithub Pages template for academic personal websites, forked from mmistakes/minimal-mistakes JavaScript 
- 
  Skywork-R1VSkywork-R1V PublicForked from SkyworkAI/Skywork-R1V Skywork-R1V3: Advanced Multimodal Reasoning Model Python 
          Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
  If the problem persists, check the GitHub status page or contact support.