Skip to content
View xlrrrr's full-sized avatar
🌳
Focusing
🌳
Focusing
  • University of Hong Kong

Block or report xlrrrr

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 180 6 Updated Oct 14, 2025

"LightReasoner: Can Small Language Models Teach Large Language Models Reasoning?"

Python 56 9 Updated Oct 14, 2025

"DeepResearch-Eval: An End-to-End Evaluation Framework for DeepResearch Systems"

Python 18 2 Updated Oct 13, 2025

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 604 58 Updated Oct 14, 2025

Awesome-GraphRAG: A curated list of resources (surveys, papers, benchmarks, and opensource projects) on graph-based retrieval-augmented generation.

1,701 156 Updated Sep 20, 2025

Build resilient language agents as graphs.

Python 19,771 3,479 Updated Oct 14, 2025

ScaleCUA is the open-sourced computer use agents that can operate on corss-platform environments (Windows, macOS, Ubuntu, Android).

Python 634 36 Updated Oct 3, 2025
Python 94 10 Updated Oct 13, 2025

Paper2Agent is a multi-agent AI system that automatically transforms research papers into interactive AI agents with minimal human input.

Jupyter Notebook 1,537 248 Updated Sep 21, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,575 579 Updated Oct 9, 2025

MCP Server for Computer Use in Windows

Python 3,136 352 Updated Oct 14, 2025

"VideoRAG: Chat with Your Videos"

Python 1,191 172 Updated Sep 12, 2025

🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 1,880 135 Updated Oct 14, 2025

[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

1,160 68 Updated Oct 11, 2025

MiroMind Research Agent: Fully Open-Source Deep Research Agent with Reproducible State-of-the-Art Performance on FutureX, GAIA, HLE, BrowserComp and xBench.

Python 748 78 Updated Oct 14, 2025

OpenCUA: Open Foundations for Computer-Use Agents

Python 515 62 Updated Oct 12, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,814 1,843 Updated Oct 6, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,894 210 Updated Aug 15, 2025

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.

Python 158 2 Updated Sep 23, 2025

Open-source, secure environment with real-world tools for enterprise-grade agents.

MDX 9,654 668 Updated Oct 14, 2025
Python 135 10 Updated Oct 14, 2025

Fetch an entire site and use it as an MCP Server

TypeScript 723 40 Updated Aug 27, 2025

"VideoAgent: All-in-One Agentic Framework for Video Understanding, Editing, and Remaking"

Python 226 35 Updated Oct 11, 2025

"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"

Python 7,613 1,074 Updated Oct 11, 2025
Jupyter Notebook 171 7 Updated May 16, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,508 2,184 Updated Oct 14, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,613 357 Updated Aug 29, 2025

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.

Python 3,003 251 Updated Jul 25, 2025
Python 103 7 Updated Oct 3, 2025

Profile-Based Long-Term Memory for AI Applications. Memobase handles user profiles, memory events, and evolving context — perfect for chatbots, companions, tutors, customer service bots, and all ch…

Python 2,225 165 Updated Oct 8, 2025
Next