**DeepL免秘钥,免启服务**,双击使用,免费无限次使用,(**新增DeepL单词查询功能**)根据网页版JavaScript加密算法逆向开发的bobplugin;所以只要官网的算法不改,理论上就可以无限使用;(重大更新!!!回馈老用户,现已优化,频繁访问后仍然可以继续免费翻译!!) **apiKey is not required,No account password required**

JavaScript 611 41 Updated Aug 30, 2024

TsinghuaC3I / Awesome-RL-for-LRMs

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,237 122 Updated Nov 9, 2025

youngyangyang04 / leetcode-master

《代码随想录》LeetCode 刷题攻略：200道经典题目刷题顺序，共60w字的详细图解，视频难点剖析，50余张思维导图，支持C++，Java，Python，Go，JavaScript等多语言版本，从此算法学习不再迷茫！🔥🔥 来看看，你会发现相见恨晚！🚀

Shell 59,904 12,312 Updated Nov 7, 2025

browser-use / browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

Python 75,164 8,973 Updated Jan 7, 2026

xuw / llm_course_public

Python 24 20 Updated Oct 12, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,302 75 Updated Jun 8, 2025

yfzhang114 / Awesome-Multimodal-Large-Language-Models

Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models

856 35 Updated Dec 4, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,086 118 Updated Jun 2, 2025

ray-project / ray-educational-materials

This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.

Jupyter Notebook 451 83 Updated Feb 13, 2024

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,430 165 Updated Mar 20, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 8,755 848 Updated Jan 8, 2026

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,822 283 Updated Dec 23, 2025

QZH-777 / longrag

3 Updated Jan 24, 2025

yafuly / MAGE

Machine-generated text detection in the wild (ACL 2024)

Python 224 12 Updated Mar 6, 2025

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Jupyter Notebook 2,364 507 Updated Jul 10, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,071 1,116 Updated Jan 10, 2026