- 
                  Nanjing University
- Nanjing
- https://czczup.github.io/
Highlights
- Pro
Stars
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"
The proposed simulated dataset consisting of 9,536 charts and associated data annotations in CSV format.
Official Repo for Open-Reasoner-Zero
学习笔记 - 码云:https://gitee.com/wanzheng_96/Modules-Learn)
Democratizing Reinforcement Learning for LLMs
A light-weight tool for evaluating LLMs in rule-based ways.
verl: Volcano Engine Reinforcement Learning for LLMs
A curated list of Multi-Modal Reinforcement Learning resources (continually updated)
SGLang is a fast serving framework for large language models and vision language models.
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Official PyTorch implementation for "Large Language Diffusion Models"
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
[🏆AAAI2025] Official Repo for ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area.
GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …