Skip to content
View lengrongfu's full-sized avatar

Organizations

@kubernetes @Project-HAMi

Block or report lengrongfu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,097 101 Updated Nov 26, 2025

Perplexity open source garden for inference technology

Rust 269 20 Updated Nov 20, 2025

My learning notes/codes for ML SYS.

Python 4,279 259 Updated Nov 25, 2025

Intelligent Router for Mixture-of-Models

Rust 2,335 298 Updated Nov 26, 2025

💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

Vue 15,816 1,440 Updated Nov 26, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,214 753 Updated Nov 26, 2025
Go 6 5 Updated Nov 24, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,984 328 Updated Nov 21, 2025
Smarty 8 Updated May 26, 2025
Go 40 6 Updated Nov 24, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,435 489 Updated Nov 25, 2025

Fast OS-level support for GPU checkpoint and restore

C++ 256 26 Updated Sep 28, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,945 1,157 Updated Nov 26, 2025

Intercept gRPC traffic of containerd with eBPF

Go 2 Updated Jan 23, 2024

This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.

C++ 56 4 Updated Mar 5, 2025

AIOS: AI Agent Operating System

Python 4,816 625 Updated Nov 24, 2025

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go 270 44 Updated Nov 24, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,697 2,559 Updated Nov 26, 2025

DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding applications in Kubernetes.

Go 151 26 Updated Nov 25, 2025

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 2,677 426 Updated Nov 26, 2025

Visualize your multi-stage Dockerfiles

Go 241 15 Updated Nov 24, 2025

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 715 63 Updated Jan 7, 2024

❤️ 充满爱的 AI 结对编程神器 —— 🐌 Guii Devtool,轻松融入现有前端项目,通过自然语言指令即可轻松定制和优化代码。我们不替代创造者或 Hackers,只愿成为他们桌旁的亲密伙伴,✨ 共同创造美好的产品。

212 Updated Jul 30, 2024

groupcache is a caching and cache-filling library, intended as a replacement for memcached in many cases.

Go 13,282 1,395 Updated Nov 29, 2024

Unlock Unlimited Potential! Share Your GPU Power Across Your Local Network!

Go 71 3 Updated May 22, 2025

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 32,333 1,930 Updated Nov 17, 2025

LLM inference in C/C++

C++ 90,444 13,835 Updated Nov 26, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,624 13,754 Updated Nov 26, 2025

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 31,944 2,013 Updated Nov 25, 2025

A collection of community maintained NRI plugins

Go 97 31 Updated Nov 22, 2025
Next