YuanDaoze

Follow

Tao Xiong YuanDaoze

Follow

0 followers · 4 following

Zhejiang University

Achievements

Achievements

Highlights

Pro

Stars

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 2,581 353 Updated Nov 15, 2025

modelscope / AgentEvolver

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 615 65 Updated Nov 21, 2025

liuyike-xiaomi / guievalkit

Forked from xiaomi-research/guievalkit

GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents

Python 5 Updated Nov 13, 2025

xiaomi-research / guievalkit

GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents

Python 16 5 Updated Sep 22, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,809 9,393 Updated Nov 24, 2025

JiayuJeff / CostBench

The raw CostBench repository under construction

Python 24 Updated Nov 21, 2025

ZhenbinChan / verl

Forked from volcengine/verl

VERL 可视化、PRM、LLM-as-a-Judge

Python 7 Updated Nov 14, 2025

TheNetAdmin / zjuthesis

Zhejiang University Graduation Thesis LaTeX Template

TeX 3,281 688 Updated Sep 8, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 43,509 4,711 Updated Nov 22, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,510 2,634 Updated Nov 24, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,112 308 Updated Nov 15, 2025

X-PLUG / MobileAgent

Mobile-Agent: The Powerful GUI Agent Family

Python 6,334 639 Updated Nov 14, 2025

MIT-MI / MEM1

Python 168 14 Updated Oct 27, 2025

yansikuan / memory-r1

65 1 Updated Sep 10, 2025

liuchen6667 / qwen_grpo_gsm8k

简单易理解的代码，用于在qwen上使用grpo加强数学能力

Python 41 5 Updated May 14, 2025

ishanjmukherjee / gsm8k-grpo

GRPOTrainer for GSM8K

Python 1 Updated Jul 3, 2025

InfiXAI / InfiGUI-G1

Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenecks in GUI agents through efficient, guided exploration.

Python 111 12 Updated Nov 19, 2025

YurunChen / HarmonyGuard

Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization”

Python 25 Updated Oct 9, 2025

THU-KEG / Agentic-Reward-Modeling

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 112 5 Updated Jun 11, 2025

ritzz-ai / GUI-R1

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 204 15 Updated May 5, 2025

XLearning-SCU / 2025-ICLR-TCR

Pytorch implementation of "Test-time Adaptation for Cross-modal Retrieval with Query Shift".

Python 28 2 Updated Nov 22, 2025

InfiXAI / InfiGUI-R1

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 61 3 Updated May 23, 2025

InfiXAI / InfiGUIAgent

71 3 Updated May 23, 2025

OS-Agent-Survey / OS-Agent-Survey

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

367 17 Updated Aug 16, 2025

horseee / Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python 1,900 146 Updated Jun 17, 2025

OSU-NLP-Group / GUI-Agents-Paper-List

Building a comprehensive and handy list of papers for GUI agents

Python 555 30 Updated Oct 27, 2025

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 26,149 2,633 Updated Nov 23, 2025

XingYu-Zhong / LLMsStudy

收集大语言模型的学习路径和各种最佳实践

313 37 Updated Mar 19, 2024

princeton-nlp / tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,686 578 Updated Jan 16, 2025

ShusenTang / Deep-Learning-with-PyTorch-Chinese

本仓库将PyTorch官方书籍《Deep learning with PyTorch》（基本摘录版）翻译成中文版并给出可运行的相关代码。

Jupyter Notebook 1,238 259 Updated Nov 25, 2020