Skip to content
View wduo's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report wduo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Application Framework for AI Engineering

Java 7,643 2,200 Updated Jan 12, 2026

Agentic AI Framework for Java Developers

Java 7,861 1,702 Updated Jan 12, 2026

A construction kit for reinforcement learning environment management.

Python 297 30 Updated Jan 12, 2026

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,622 198 Updated Jan 12, 2026

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 73 3 Updated Jul 18, 2025

RewardBench: the first evaluation tool for reward models.

Python 678 95 Updated Jun 12, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,793 319 Updated Nov 13, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,062 2,098 Updated May 19, 2025

the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"

Python 884 39 Updated Oct 26, 2024

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,110 1,119 Updated Jan 12, 2026

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,709 1,507 Updated Jan 4, 2026

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,702 76 Updated May 11, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 26,117 1,837 Updated Jan 9, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,244 3,004 Updated Jan 12, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,289 4,022 Updated Jan 12, 2026

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,323 1,894 Updated Sep 8, 2025
Jupyter Notebook 2,747 365 Updated May 2, 2025

Self-Reflection in LLM Agents: Effects on Problem-Solving Performance

Python 92 10 Updated Nov 25, 2024

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,769 848 Updated Jan 8, 2026

Git extension for versioning large files

Go 13,989 2,186 Updated Jan 3, 2026

Fine-Grained Open Domain Image Animation with Motion Guidance

Python 957 78 Updated Oct 18, 2024

PaddleNLP UIE模型的PyTorch版实现

Python 674 120 Updated Aug 13, 2023

超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题

Python 131 31 Updated Oct 9, 2021

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 639 101 Updated Oct 19, 2021

Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs)

1,042 61 Updated Nov 18, 2024

Neo4j graph construction from unstructured data using LLMs

Jupyter Notebook 4,263 758 Updated Jan 12, 2026

Modeling, training, eval, and inference code for OLMo

Python 6,281 695 Updated Nov 24, 2025

RoBERTa中文预训练模型: RoBERTa for Chinese

Python 2,765 411 Updated Jul 22, 2024

Open source Python library for converting PDF to DOCX.

Python 3,263 470 Updated May 28, 2025
Next