Skip to content
View lbh2001's full-sized avatar
🎣
Fishing
🎣
Fishing

Organizations

@bullfrog-store

Block or report lbh2001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repo release the detailed benchmark code and results of Sea Labs AI.

Python 11 1 Updated Jan 3, 2026

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,908 319 Updated Jan 6, 2026

Header-only C++ binding for libzmq

C++ 2,249 795 Updated Dec 19, 2025

Flash Attention from Scratch on CUDA Ampere

Assembly 116 14 Updated Sep 1, 2025

[ICLR2025] Codebase for "ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing", built on Megatron-LM.

Python 104 9 Updated Dec 20, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 4,655 393 Updated Jan 13, 2026

Nano vLLM

Python 10,726 1,376 Updated Nov 3, 2025

This repository is responsible for the LLVM-related parts of Jeandle.

LLVM 145 27 Updated Dec 26, 2025

Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…

Java 394 52 Updated Jan 13, 2026

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 2,248 252 Updated Jan 13, 2026

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 40,743 7,104 Updated Jan 13, 2026

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python 251 45 Updated Jan 12, 2026

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,565 501 Updated Jan 13, 2026

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,514 647 Updated Jan 12, 2026

how to optimize some algorithm in cuda.

Cuda 2,754 250 Updated Jan 8, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,409 12,560 Updated Jan 13, 2026

从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!

Jupyter Notebook 1,937 131 Updated Nov 22, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,368 4,032 Updated Jan 13, 2026

FlashInfer: Kernel Library for LLM Serving

Python 4,626 644 Updated Jan 13, 2026

My learning notes for ML SYS.

Python 5,029 328 Updated Jan 8, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,379 1,109 Updated Jan 9, 2026

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

C++ 23,569 5,918 Updated Jan 13, 2026

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 17,441 4,091 Updated Jan 13, 2026

📝A simple and elegant markdown editor, available for Linux, macOS and Windows.

JavaScript 53,191 3,926 Updated Nov 19, 2025

科技爱好者周刊,每周五发布

82,334 3,830 Updated Jan 9, 2026

Scalable NameNode RPC Proxy for HDFS Federation

Java 86 16 Updated Apr 19, 2016

A Vector Database Tutorial (over CMU-DB's BusTub system)

C++ 740 23 Updated Jan 19, 2025

The official home of the Presto distributed SQL query engine for big data

Java 16,620 5,515 Updated Jan 13, 2026

贺师俊与360的劳动争议诉讼

2,435 160 Updated Mar 19, 2024

A light-weight RPC implement of google protobuf RPC framework.

C++ 2,148 653 Updated Aug 24, 2023
Next