Skip to content
View QZH-777's full-sized avatar
  • Tsinghua University
  • Tsinghua University

Block or report QZH-777

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

This repository hosts a collection of datasets for training and evaluating CUA / GUI agents.

91 7 Updated Jul 27, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 21,445 3,413 Updated Jan 5, 2026
Dockerfile 36 12 Updated May 15, 2025

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 1,965 164 Updated Dec 16, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,721 386 Updated Dec 23, 2025

**DeepL免秘钥,免启服务**,双击使用,免费无限次使用,(**新增DeepL单词查询功能**)根据网页版JavaScript加密算法逆向开发的bobplugin;所以只要官网的算法不改,理论上就可以无限使用;(重大更新!!!回馈老用户,现已优化,频繁访问后仍然可以继续免费翻译!!) **apiKey is not required,No account password required**

JavaScript 611 41 Updated Aug 30, 2024

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,237 122 Updated Nov 9, 2025

《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

Shell 59,904 12,312 Updated Nov 7, 2025

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 75,164 8,973 Updated Jan 7, 2026
Python 24 20 Updated Oct 12, 2025

Latest Advances on System-2 Reasoning

Python 1,302 75 Updated Jun 8, 2025

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

856 35 Updated Dec 4, 2025

Official Repo for Open-Reasoner-Zero

Python 2,086 118 Updated Jun 2, 2025

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 451 83 Updated Feb 13, 2024

Reproduce R1 Zero on Logic Puzzle

Python 2,430 165 Updated Mar 20, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,755 848 Updated Jan 8, 2026

Simple RL training for reasoning

Python 3,822 283 Updated Dec 23, 2025
3 Updated Jan 24, 2025

Machine-generated text detection in the wild (ACL 2024)

Python 224 12 Updated Mar 6, 2025

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,364 507 Updated Jul 10, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,071 1,116 Updated Jan 10, 2026

Scalable RL solution for advanced reasoning of language models

Python 1,794 101 Updated Mar 18, 2025
Python 552 66 Updated Jan 2, 2025
JavaScript 86 9 Updated Dec 11, 2025

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 492 31 Updated Jun 6, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

72,712 8,353 Updated Dec 22, 2025

Towards Large Multimodal Models as Visual Foundation Agents

Python 248 9 Updated Apr 24, 2025
Python 281 20 Updated Aug 18, 2025

An LLM-based Web Navigating Agent (KDD'24)

Python 918 83 Updated Sep 27, 2024

Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.

229 16 Updated May 28, 2025
Next