aojugg

aojugg

2 followers · 3 following

Stars

changyeyu / LLM-RL-Visualized

🌟100+ 原创 LLM / RL 原理图📚，《大模型算法》作者巨献！💥（100+ LLM/RL Algorithm Maps ）

Python 1,654 179 Updated Oct 20, 2025

QwenLM / Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,079 1,102 Updated Sep 26, 2025

shareAI-lab / analysis_claude_code

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档，以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 10,972 2,888 Updated Jul 19, 2025

shareAI-lab / share-best-prompt

Forked from x1xhlol/system-prompts-and-models-of-ai-tools

世界上最好的提示词（总计估值超过300亿的提示词）外国网友x1xh成功获取了 v0、Manus、Cursor、Same.dev 和 Lovable 的完整官方系统提示词和内部工具。

Python 246 44 Updated Apr 15, 2025

0russwest0 / Awesome-Agent-RL

414 14 Updated Oct 11, 2025

langfengQ / verl-agent

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,078 91 Updated Oct 20, 2025

cmriat / l0

A scalable, end-to-end training pipeline for general-purpose agents

Python 360 54 Updated Jul 4, 2025

zzli2022 / Awesome-System2-Reasoning-LLM

Latest Advances on System-2 Reasoning

Python 1,258 72 Updated Jun 8, 2025

GeeeekExplorer / nano-vllm

Nano vLLM

Python 7,196 926 Updated Aug 31, 2025

opendilab / PPOxFamily

PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）

Python 2,417 205 Updated Mar 13, 2025

Qihoo360 / Light-R1

Python 749 49 Updated Sep 3, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,565 432 Updated Oct 24, 2025

inclusionAI / PromptCoT

A unified suite for generating elite reasoning problems and training high-performance LLMs, including pioneering attention-free architectures

Python 122 11 Updated Sep 26, 2025

lmmlzn / Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

1,368 136 Updated Oct 11, 2025

microsoft / RedStone

The RedStone repository includes code for preparing extensive datasets used in training large language models.

Python 143 11 Updated Jun 30, 2025

camel-ai / camel

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 14,611 1,600 Updated Oct 25, 2025

NovaSky-AI / SkyThought

Sky-T1: Train your own O1 preview model within $450

Python 3,342 339 Updated Jul 12, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,843 375 Updated Oct 17, 2025

datawhalechina / whale-quant

本项目为量化开源课程，可以帮助人们快速掌握量化金融知识以及使用Python进行量化开发的能力。

Jupyter Notebook 1,639 222 Updated May 21, 2025

pengr / LLM-Synthetic-Data

A live reading list for LLM data synthesis (Updated to July, 2025).

398 35 Updated Aug 26, 2025

wasiahmad / Awesome-LLM-Synthetic-Data

A reading list on LLM based Synthetic Data Generation 🔥

1,444 87 Updated Jun 5, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,231 805 Updated Oct 23, 2025

maitrix-org / llm-reasoners

A library for advanced large language model reasoning

Python 2,292 201 Updated Jun 10, 2025

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 6,054 664 Updated Oct 24, 2025

FlagOpen / FlagData

Python 355 42 Updated Jun 13, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 29,502 2,352 Updated Oct 25, 2025

IntelLabs / RAG-FiT

Framework for enhancing LLMs for RAG tasks using fine-tuning.

Python 752 59 Updated May 22, 2025

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,395 200 Updated May 7, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,256 11,221 Updated Oct 22, 2025

richards199999 / Thinking-Claude

Let your Claude able to think

TypeScript 16,233 1,909 Updated Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly