-
Tsinghua University
- Tsinghua University
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
This repository hosts a collection of datasets for training and evaluating CUA / GUI agents.
An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
**DeepL免秘钥,免启服务**,双击使用,免费无限次使用,(**新增DeepL单词查询功能**)根据网页版JavaScript加密算法逆向开发的bobplugin;所以只要官网的算法不改,理论上就可以无限使用;(重大更新!!!回馈老用户,现已优化,频繁访问后仍然可以继续免费翻译!!) **apiKey is not required,No account password required**
A Survey of Reinforcement Learning for Large Reasoning Models
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Latest Advances on System-2 Reasoning
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Official Repo for Open-Reasoner-Zero
This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
Machine-generated text detection in the wild (ACL 2024)
Solutions of Reinforcement Learning, An Introduction
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Scalable RL solution for advanced reasoning of language models
Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Towards Large Multimodal Models as Visual Foundation Agents
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.