Skip to content
View YuanDaoze's full-sized avatar
  • Zhejiang University

Highlights

  • Pro

Block or report YuanDaoze

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

复现大模型相关算法及一些学习记录

Python 2,581 353 Updated Nov 15, 2025

AgentEvolver: Towards Efficient Self-Evolving Agent System

Python 615 65 Updated Nov 21, 2025

GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents

Python 5 Updated Nov 13, 2025

GUIEvalKit: Open-source Evaluation Toolkit for GUI Agents

Python 16 5 Updated Sep 22, 2025

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 64,809 9,393 Updated Nov 24, 2025

The raw CostBench repository under construction

Python 24 Updated Nov 21, 2025

VERL 可视化、PRM、LLM-as-a-Judge

Python 7 Updated Nov 14, 2025

Zhejiang University Graduation Thesis LaTeX Template

TeX 3,281 688 Updated Sep 8, 2025

Universal memory layer for AI Agents

Python 43,509 4,711 Updated Nov 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,510 2,634 Updated Nov 24, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,112 308 Updated Nov 15, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 6,334 639 Updated Nov 14, 2025
Python 168 14 Updated Oct 27, 2025

简单易理解的代码,用于在qwen上使用grpo加强数学能力

Python 41 5 Updated May 14, 2025

GRPOTrainer for GSM8K

Python 1 Updated Jul 3, 2025

Official repository for InfiGUI-G1. We introduce Adaptive Exploration Policy Optimization (AEPO) to overcome semantic alignment bottlenecks in GUI agents through efficient, guided exploration.

Python 111 12 Updated Nov 19, 2025

Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Optimization”

Python 25 Updated Oct 9, 2025

[ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 112 5 Updated Jun 11, 2025

Official implementation of GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents

Python 204 15 Updated May 5, 2025

Pytorch implementation of "Test-time Adaptation for Cross-modal Retrieval with Query Shift".

Python 28 2 Updated Nov 22, 2025

Repository for the paper "InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners"

Python 61 3 Updated May 23, 2025

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

367 17 Updated Aug 16, 2025

A curated list for Efficient Large Language Models

Python 1,900 146 Updated Jun 17, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 555 30 Updated Oct 27, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 26,149 2,633 Updated Nov 23, 2025

收集大语言模型的学习路径和各种最佳实践

313 37 Updated Mar 19, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,686 578 Updated Jan 16, 2025

本仓库将PyTorch官方书籍《Deep learning with PyTorch》(基本摘录版)翻译成中文版并给出可运行的相关代码。

Jupyter Notebook 1,238 259 Updated Nov 25, 2020
Next