an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具，本地化网页操作，无需连接外网

Python 1,750 207 Updated Nov 26, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 11,142 983 Updated Nov 19, 2025

OpenBMB / AgentCPM-GUI

AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.

Python 1,109 103 Updated Jun 14, 2025

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,500 59 Updated Jun 14, 2025

jefferyZhan / Griffon

Official repo of Griffon series including v1(ECCV 2024), v2(ICCV 2025), G, and R, and also the RL tool Vision-R1.

Python 243 12 Updated Aug 12, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,527 298 Updated Nov 13, 2025

TencentQQGYLab / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 6,222 704 Updated Mar 19, 2025

TideDra / lmm-r1

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 829 54 Updated May 14, 2025

turningpoint-ai / VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Python 617 23 Updated Mar 18, 2025

luo-junyu / Awesome-Agent-Papers

[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges

2,105 62 Updated Nov 7, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,731 7,594 Updated Nov 19, 2025

inclusionAI / AReaL

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 3,031 235 Updated Nov 19, 2025

xlang-ai / aguvis

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Python 370 26 Updated Mar 7, 2025

0russwest0 / Agent-R1

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Python 899 57 Updated Nov 19, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Jupyter Notebook 2,401 186 Updated Nov 18, 2025

modelscope / awesome-deep-reasoning

Collect every awesome work about r1!

Python 421 15 Updated May 2, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,420 816 Updated Nov 9, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,467 3,988 Updated Nov 19, 2025

ModalMinds / MM-EUREKA

MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning

Python 761 29 Updated Sep 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

zhongpu JepsonWong

Achievements

Achievements

Block or report JepsonWong

Starred repositories

OSU-NLP-Group / GUI-Agents-Paper-List

InternLM / xtuner

alibaba / ROLL

X-PLUG / MobileAgent

showlab / Awesome-GUI-Agent

THUDM / slime

ByteDance-Seed / VeOmni

tmgthb / Autonomous-Agents

LiuZH-19 / SongGen

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

tencent-ailab / SongGeneration

jianchang512 / vocal-separate