Skip to content
View aojugg's full-sized avatar

Block or report aojugg

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,654 179 Updated Oct 20, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,079 1,102 Updated Sep 26, 2025

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 10,972 2,888 Updated Jul 19, 2025

世界上最好的提示词 (总计估值超过300亿的提示词)外国网友x1xh成功获取了 v0、Manus、Cursor、Same.dev 和 Lovable 的完整官方系统提示词和内部工具。

Python 246 44 Updated Apr 15, 2025

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,078 91 Updated Oct 20, 2025

A scalable, end-to-end training pipeline for general-purpose agents

Python 360 54 Updated Jul 4, 2025

Latest Advances on System-2 Reasoning

Python 1,258 72 Updated Jun 8, 2025

Nano vLLM

Python 7,196 926 Updated Aug 31, 2025

PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )

Python 2,417 205 Updated Mar 13, 2025
Python 749 49 Updated Sep 3, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,565 432 Updated Oct 24, 2025

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 122 11 Updated Sep 26, 2025

Summarize existing representative LLMs text datasets.

1,368 136 Updated Oct 11, 2025

The RedStone repository includes code for preparing extensive datasets used in training large language models.

Python 143 11 Updated Jun 30, 2025

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,611 1,600 Updated Oct 25, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,342 339 Updated Jul 12, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,843 375 Updated Oct 17, 2025

本项目为量化开源课程,可以帮助人们快速掌握量化金融知识以及使用Python进行量化开发的能力。

Jupyter Notebook 1,639 222 Updated May 21, 2025

A live reading list for LLM data synthesis (Updated to July, 2025).

398 35 Updated Aug 26, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,444 87 Updated Jun 5, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,231 805 Updated Oct 23, 2025

A library for advanced large language model reasoning

Python 2,292 201 Updated Jun 10, 2025

Modeling, training, eval, and inference code for OLMo

Python 6,054 664 Updated Oct 24, 2025
Python 355 42 Updated Jun 13, 2024

DSPy: The framework for programming—not prompting—language models

Python 29,502 2,352 Updated Oct 25, 2025

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Python 752 59 Updated May 22, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,395 200 Updated May 7, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,256 11,221 Updated Oct 22, 2025

Let your Claude able to think

TypeScript 16,233 1,909 Updated Mar 10, 2025
Next