fishmingyu

🎯

Focusing

Leo Yu fishmingyu

🎯

Focusing

A current Ph.D. in the Department of Computer Science and Engineering, University of California, San Diego

56 followers · 44 following

San Diego
04:09 (UTC -08:00)
fishmingyu.github.io

Achievements

Highlights

Organizations

Lists (1)

Sort

🔮 Future ideas

Stars

sgl-project / mini-sglang

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,895 316 Updated Jan 6, 2026

microsoft / multilspy

multilspy is a lsp client library in Python intended to be used to build applications around language servers.

Python 518 94 Updated Sep 3, 2025

cocoindex-io / cocoindex

Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!

Rust 5,802 425 Updated Jan 12, 2026

ezyang / ghstack

Submit stacked diffs to GitHub on the command line

Python 897 78 Updated Dec 30, 2025

flashinfer-ai / flashinfer-bench

Building the Virtuous Cycle for AI-driven LLM Systems

Python 113 17 Updated Jan 11, 2026

oraios / serena

A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)

Python 18,516 1,261 Updated Jan 11, 2026

bytedance / Repo2Run

Repo2Run is an LLM-based agent that automates environment configuration by generating error-free Dockerfiles for Python repositories.

Python 147 23 Updated Nov 18, 2025

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 2,047 231 Updated Oct 16, 2025

castorini / rank_llm

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 568 81 Updated Dec 26, 2025

letta-ai / letta-code

The memory-first coding agent

TypeScript 833 96 Updated Jan 12, 2026

scikit-build / scikit-build-core

A next generation Python CMake adaptor and Python API for plugins

Python 430 80 Updated Jan 9, 2026

memgraph / memgraph

Open-source graph database, tuned for dynamic analytics environments. Easy to adopt, scale and own.

C++ 3,598 197 Updated Jan 12, 2026

PyO3 / maturin

Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages

Rust 5,291 372 Updated Jan 12, 2026

flashinfer-ai / cubloaty

a size profiler for cuda binary

Python 69 Updated Oct 7, 2025

google / tunix

A Lightweight LLM Post-Training Library

Python 2,104 220 Updated Jan 10, 2026

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

Python 386 44 Updated Nov 20, 2025

lmgame-org / GRL

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 58 10 Updated Dec 18, 2025

OpenHands / software-agent-sdk

A clean, modular SDK for building AI agents with OpenHands V1.

Python 413 106 Updated Jan 12, 2026

dropbox / gemlite

Fast low-bit matmul kernels in Triton

Python 420 31 Updated Dec 18, 2025

coder / coder

Secure environments for developers and their agents

Go 11,943 1,135 Updated Jan 12, 2026

gokr / niffler

Command line AI assistant written in Nim

Nim 22 Updated Dec 15, 2025

inclusionAI / AWorld

Build, evaluate and train General Multi-Agent Assistance with ease

Python 1,092 113 Updated Jan 12, 2026

gso-bench / gso

[NeurIPS '25] GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents

Python 61 3 Updated Dec 23, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,330 12,535 Updated Jan 12, 2026

SakanaAI / natural_niches

The code repository of the paper: Competition and Attraction Improve Model Fusion

Jupyter Notebook 168 33 Updated Aug 25, 2025

facebookresearch / aira-dojo

AIRA-dojo: a framework for developing and evaluating AI research agents

Python 123 22 Updated Nov 17, 2025

microsoft / RetrievalAttention

Scalable long-context LLM decoding that leverages sparsity—by treating the KV cache as a vector storage system.

Python 112 19 Updated Jan 1, 2026

langchain-ai / open_deep_research

Python 10,166 1,480 Updated Aug 27, 2025

QwenLM / Qwen3-Coder

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,845 1,035 Updated Dec 4, 2025

PanZaifeng / FastTree-Artifact

Python 27 3 Updated Mar 24, 2025