Skip to content
View MikeDean2367's full-sized avatar

Block or report MikeDean2367

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 1,464 132 Updated Nov 23, 2025

The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

Python 387 12 Updated Jul 11, 2025

Self-Adapting Language Models

Python 1,537 271 Updated Aug 1, 2025

DeepEvolve is a research and coding agent for new algorithm discovery in different science domains with Deep Research and AlphaEvolve.

Python 95 10 Updated Oct 11, 2025

OceanGym: A Benchmark Environment for Underwater Embodied Agents

Python 40 4 Updated Nov 11, 2025

pathlib api extended to use fsspec backends

Python 345 46 Updated Nov 19, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 4,023 229 Updated Nov 5, 2025

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coo…

JavaScript 2,909 395 Updated Sep 29, 2025

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,019 101 Updated Nov 21, 2025

本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。

JavaScript 11,360 2,977 Updated Jul 19, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,758 449 Updated Nov 23, 2025

🏆 ICML 2025 Spotlight

Python 333 18 Updated Jul 14, 2025

What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?

TypeScript 16,331 1,249 Updated Sep 21, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,387 2,617 Updated Nov 23, 2025

Development environments for coding agents. Enable multiple agents to work safely and independently with your preferred stack.

Go 3,274 171 Updated Nov 10, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA (+ more DSLs)

Python 674 90 Updated Nov 21, 2025

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 8,976 1,006 Updated Aug 13, 2025

LookAhead Tuning: Safer Language Models via Partial Answer Previews

Python 16 Updated Mar 26, 2025

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,609 425 Updated Oct 16, 2025

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

Python 12,435 1,145 Updated Sep 26, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,466 2,899 Updated Nov 23, 2025

[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression

Python 123 5 Updated Apr 12, 2025

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Python 38 5 Updated Oct 10, 2025

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 17,833 1,887 Updated Nov 17, 2025

MLGym A New Framework and Benchmark for Advancing AI Research Agents

Python 574 55 Updated Aug 10, 2025

Automatically update arXiv papers about LLM Reasoning, LLM Evaluation, LLM & MLLM and Video Understanding using Github Actions.

Python 128 9 Updated Nov 23, 2025

This is the reading list of Large Language Model-Based Data Science Agent

35 3 Updated Nov 3, 2025

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 426 42 Updated Sep 30, 2024

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 8,269 634 Updated Sep 22, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1,234 97 Updated Jun 23, 2025
Next