Skip to content
View an-yongqi's full-sized avatar

Highlights

  • Pro

Block or report an-yongqi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Recovery-Bench is a benchmark for evaluating the capability of LLM agents to recover from mistakes

Python 8 3 Updated Sep 11, 2025

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python 19,066 1,989 Updated Oct 24, 2025

📑 PageIndex: Document Index for Reasoning-based RAG

Python 3,736 271 Updated Nov 5, 2025

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 656 40 Updated Jul 22, 2024

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,958 1,294 Updated Nov 3, 2025

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 27,081 2,716 Updated Nov 5, 2025

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 17,909 2,230 Updated Nov 6, 2025

WideSearch: Benchmarking Agentic Broad Info-Seeking

Python 99 10 Updated Oct 9, 2025

Trae Agent is an LLM-based agent for general purpose software engineering tasks.

Python 9,907 1,024 Updated Sep 24, 2025

The absolute trainer to light up AI agents.

Python 7,245 551 Updated Nov 6, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,390 244 Updated Nov 6, 2025

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 41,507 2,716 Updated Nov 5, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,113 1,905 Updated Nov 1, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 14,370 2,181 Updated Nov 6, 2025

Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"

Python 104 8 Updated Oct 11, 2025

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

679 33 Updated Oct 20, 2025

Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible experimentation and exploration.

Python 84 8 Updated Oct 16, 2025

Gemini is a modern LaTex beamerposter theme 🖼

TeX 1,159 273 Updated Nov 2, 2025

Tools for merging pretrained large language models.

Python 6,435 631 Updated Oct 31, 2025

Scaling RL on advanced reasoning models

Python 629 39 Updated Oct 20, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,972 260 Updated Jul 7, 2025

Nano vLLM

Python 8,354 1,020 Updated Nov 3, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,759 293 Updated Nov 6, 2025

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 248 25 Updated Jan 31, 2025

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Python 44 3 Updated Jul 17, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,174 2,438 Updated Nov 6, 2025

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,147 63 Updated Sep 30, 2025

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Python 152 9 Updated Oct 13, 2025
Python 30 4 Updated Mar 17, 2025
Next