Skip to content
View lengrongfu's full-sized avatar

Organizations

@kubernetes @Project-HAMi

Block or report lengrongfu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,096 101 Updated Nov 25, 2025

Perplexity open source garden for inference technology

Rust 265 20 Updated Nov 20, 2025

My learning notes/codes for ML SYS.

Python 4,272 259 Updated Nov 22, 2025

Intelligent Router for Mixture-of-Models

Rust 2,330 297 Updated Nov 25, 2025

💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

Vue 15,808 1,437 Updated Nov 25, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,202 748 Updated Nov 25, 2025
Go 6 5 Updated Nov 24, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,980 327 Updated Nov 21, 2025
Smarty 8 Updated May 26, 2025
Go 40 6 Updated Nov 24, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,434 487 Updated Nov 25, 2025

Fast OS-level support for GPU checkpoint and restore

C++ 256 26 Updated Sep 28, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,925 1,155 Updated Nov 25, 2025

Intercept gRPC traffic of containerd with eBPF

Go 2 Updated Jan 23, 2024

This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.

C++ 56 4 Updated Mar 5, 2025

AIOS: AI Agent Operating System

Python 4,810 624 Updated Nov 24, 2025

☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!

Go 270 44 Updated Nov 24, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,691 2,558 Updated Nov 23, 2025

DRANET is a Kubernetes Network Driver that uses Dynamic Resource Allocation (DRA) to deliver high-performance networking for demanding applications in Kubernetes.

Go 148 26 Updated Nov 25, 2025

Heterogeneous AI Computing Virtualization Middleware(Project under CNCF)

Go 2,671 424 Updated Nov 21, 2025

Visualize your multi-stage Dockerfiles

Go 240 15 Updated Nov 24, 2025

Implementation of plug in and play Attention from "LongNet: Scaling Transformers to 1,000,000,000 Tokens"

Python 715 63 Updated Jan 7, 2024

❤️ 充满爱的 AI 结对编程神器 —— 🐌 Guii Devtool,轻松融入现有前端项目,通过自然语言指令即可轻松定制和优化代码。我们不替代创造者或 Hackers,只愿成为他们桌旁的亲密伙伴,✨ 共同创造美好的产品。

212 Updated Jul 30, 2024

groupcache is a caching and cache-filling library, intended as a replacement for memcached in many cases.

Go 13,282 1,395 Updated Nov 29, 2024

Unlock Unlimited Potential! Share Your GPU Power Across Your Local Network!

Go 71 3 Updated May 22, 2025

This is the Rust course used by the Android team at Google. It provides you the material to quickly teach Rust.

Rust 32,327 1,930 Updated Nov 17, 2025

LLM inference in C/C++

C++ 90,390 13,823 Updated Nov 25, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,558 13,741 Updated Nov 22, 2025

🐶 Kubernetes CLI To Manage Your Clusters In Style!

Go 31,933 2,013 Updated Nov 25, 2025

A collection of community maintained NRI plugins

Go 97 31 Updated Nov 22, 2025
Next