wduo

Follow

🎯

Focusing

Duo Wang wduo

🎯

Focusing

Follow

A wandering machine learning researcher, bouncing between groups. I want to understand things clearly, and explain them well. - Colah

24 followers · 37 following

Pretending in Hangzhou Creative Culture Company(PH3C)
Beijing(wangduo.cnblogs.com)
zhihu.com/people/wangduo2014

Achievements

Achievements

Stars

spring-projects / spring-ai

An Application Framework for AI Engineering

Java 7,662 2,205 Updated Jan 13, 2026

alibaba / spring-ai-alibaba

Agentic AI Framework for Java Developers

Java 7,887 1,712 Updated Jan 13, 2026

alibaba / ROCK

A construction kit for reinforcement learning environment management.

Python 300 32 Updated Jan 13, 2026

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,639 203 Updated Jan 13, 2026

THU-KEG / RM-Bench

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 73 3 Updated Jul 18, 2025

allenai / reward-bench

RewardBench: the first evaluation tool for reward models.

Python 678 95 Updated Jun 12, 2025

PeterGriffinJin / Search-R1

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,800 320 Updated Nov 13, 2025

HqWu-HITCS / Awesome-Chinese-LLM

整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。

22,078 2,098 Updated May 19, 2025

tangqiaoyu / ToolAlpaca

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 884 39 Updated Oct 26, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,140 1,127 Updated Jan 13, 2026

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,746 1,511 Updated Jan 4, 2026

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,704 76 Updated May 11, 2025

deepseek-ai / DeepSeek-R1

91,699 11,780 Updated Jun 27, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,132 1,839 Updated Jan 9, 2026

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,295 3,017 Updated Jan 13, 2026

sgl-project / sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,384 4,034 Updated Jan 13, 2026

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,333 1,894 Updated Sep 8, 2025

mshumer / OpenDeepResearcher

Jupyter Notebook 2,748 365 Updated May 2, 2025

matthewrenze / self-reflection

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Python 92 10 Updated Nov 25, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,776 848 Updated Jan 8, 2026

git-lfs / git-lfs

Git extension for versioning large files

Go 13,995 2,187 Updated Jan 3, 2026

alibaba / animate-anything

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 957 78 Updated Oct 18, 2024

HUSTAI / uie_pytorch

PaddleNLP UIE模型的PyTorch版实现

Python 675 120 Updated Aug 13, 2023

qingyujean / document-level-classification

超长文本分类（大于1000字）；文档级/篇章级文本分类；主要是解决长距离依赖问题

Python 131 31 Updated Oct 9, 2021

xuyige / BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 640 101 Updated Oct 19, 2021

quqxui / Awesome-LLM4IE-Papers

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

1,043 62 Updated Nov 18, 2024

neo4j-labs / llm-graph-builder

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 4,266 758 Updated Jan 13, 2026

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,283 695 Updated Nov 24, 2025

brightmart / roberta_zh

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,765 411 Updated Jul 22, 2024

ArtifexSoftware / pdf2docx

Open source Python library for converting PDF to DOCX.

Python 3,266 471 Updated May 28, 2025