Skip to content
View fishmingyu's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@dgSPARSE @MLSys-UCSD

Block or report fishmingyu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,895 316 Updated Jan 6, 2026

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

Python 518 94 Updated Sep 3, 2025

Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

Rust 5,802 425 Updated Jan 12, 2026

Submit stacked diffs to GitHub on the command line

Python 897 78 Updated Dec 30, 2025

Building the Virtuous Cycle for AI-driven LLM Systems

Python 113 17 Updated Jan 11, 2026

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 18,516 1,261 Updated Jan 11, 2026

Repo2Run is an LLM-based agent that automates environment configuration by generating error-free Dockerfiles for Python repositories.

Python 147 23 Updated Nov 18, 2025

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,047 231 Updated Oct 16, 2025

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 568 81 Updated Dec 26, 2025

The memory-first coding agent

TypeScript 833 96 Updated Jan 12, 2026

A next generation Python CMake adaptor and Python API for plugins

Python 430 80 Updated Jan 9, 2026

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

C++ 3,598 197 Updated Jan 12, 2026

Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages

Rust 5,291 372 Updated Jan 12, 2026

a size profiler for cuda binary

Python 69 Updated Oct 7, 2025

A Lightweight LLM Post-Training Library

Python 2,104 220 Updated Jan 10, 2026

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 386 44 Updated Nov 20, 2025

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 58 10 Updated Dec 18, 2025

A clean, modular SDK for building AI agents with OpenHands V1.

Python 413 106 Updated Jan 12, 2026

Fast low-bit matmul kernels in Triton

Python 420 31 Updated Dec 18, 2025

Secure environments for developers and their agents

Go 11,943 1,135 Updated Jan 12, 2026

Command line AI assistant written in Nim

Nim 22 Updated Dec 15, 2025

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,092 113 Updated Jan 12, 2026

[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

Python 61 3 Updated Dec 23, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,330 12,535 Updated Jan 12, 2026

The code repository of the paper: Competition and Attraction Improve Model Fusion

Jupyter Notebook 168 33 Updated Aug 25, 2025

AIRA-dojo: a framework for developing and evaluating AI research agents

Python 123 22 Updated Nov 17, 2025

Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 112 19 Updated Jan 1, 2026

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,845 1,035 Updated Dec 4, 2025
Python 27 3 Updated Mar 24, 2025
Next