- Chengdu Sichuan China
- https://lengrongfu.github.io/
-
dce-charts-repackage Public
Forked from DaoCloud/dce-charts-repackagehelm repo add daocloud https://daocloud.github.io/dce-charts-repackage/
Mustache UpdatedNov 26, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 26, 2025 -
HAMi Public
Forked from Project-HAMi/HAMiOpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
Go Apache License 2.0 UpdatedNov 25, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedNov 10, 2025 -
semantic-router Public
Forked from vllm-project/semantic-routerIntelligent Mixture-of-Models Router for Efficient LLM Inference
Go Apache License 2.0 UpdatedSep 22, 2025 -
candle Public
Forked from huggingface/candleMinimalist ML framework for Rust
Rust Apache License 2.0 UpdatedSep 12, 2025 -
LMCache Public
Forked from LMCache/LMCacheRedis for LLMs
Python Apache License 2.0 UpdatedSep 5, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedSep 5, 2025 -
modelscope Public
Forked from modelscope/modelscopeModelScope: bring the notion of Model-as-a-Service to life.
Python Apache License 2.0 UpdatedAug 25, 2025 -
k8s-dra-driver Public
Forked from NVIDIA/k8s-dra-driver-gpuDynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes
Go Apache License 2.0 UpdatedAug 14, 2025 -
vllm-ascend Public
Forked from vllm-project/vllm-ascendCommunity maintained hardware plugin for vLLM on Ascend
Python Apache License 2.0 UpdatedJul 3, 2025 -
snapshots-quota Public
Containerd snapshots quota NRI plugin, user can set every container ephemeral storage, but in ephemeral storage use full pod will not restart.
-
-
HAMi-core Public
Forked from Project-HAMi/HAMi-coreHAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
C UpdatedMay 26, 2025 -
nerdctl Public
Forked from containerd/nerdctlcontaiNERD CTL - Docker-compatible CLI for containerd, with support for Compose, Rootless, eStargz, OCIcrypt, IPFS, ...
Go Apache License 2.0 UpdatedMay 22, 2025 -
-
production-stack Public
Forked from vllm-project/production-stackvLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
Python Apache License 2.0 UpdatedMay 9, 2025 -
LLMDistribution Public
LLMDistribution is a local model distribution server.
-
-
containerd Public
Forked from containerd/containerdAn open and reliable container runtime
-
kubernetes Public
Forked from kubernetes/kubernetesProduction-Grade Container Scheduling and Management
Go Apache License 2.0 UpdatedApr 2, 2025 -
cloudtty Public
Forked from cloudtty/cloudttyA Friendly Kubernetes CloudShell (Web Terminal) !
Go MIT License UpdatedMar 19, 2025 -
-
-
mpu Public
Forked from limstash/mpuA shim driver allows in-docker nvidia-smi showing correct process list without modify anything
-
DualPipe Public
Forked from deepseek-ai/DualPipeA bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
Python MIT License UpdatedMar 5, 2025 -
-
-
study-demo Public
A collection of hands-on demos and code snippets for learning and experimenting with various programming concepts, frameworks, and tools. Ideal for self-study and quick references.
-
scuda Public
Forked from kevmo314/scudaSCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.
C++ Apache License 2.0 UpdatedJan 23, 2025