-
fluss-rust Public
Forked from apache/fluss-rustRust Client for Apache Fluss (Incubating)
Rust Apache License 2.0 UpdatedNov 23, 2025 -
TransferQueue Public
Forked from 0oshowero0/TransferQueuePython Apache License 2.0 UpdatedNov 11, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedOct 27, 2025 -
nanochat Public
Forked from karpathy/nanochatThe best ChatGPT that $100 can buy.
Python MIT License UpdatedOct 25, 2025 -
3FS Public
Forked from deepseek-ai/3FSA high-performance distributed file system designed to address the challenges of AI training and inference workloads.
C++ MIT License UpdatedSep 7, 2025 -
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedAug 27, 2025 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 15, 2025 -
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++ Apache License 2.0 UpdatedJul 2, 2025 -
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedJun 27, 2025 -
uccl Public
Forked from uccl-project/ucclUltra and Unified CCL
C++ Apache License 2.0 UpdatedJun 26, 2025 -
ray Public
Forked from ray-project/rayRay is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Python Apache License 2.0 UpdatedJun 24, 2025 -
cxx Public
Forked from dtolnay/cxxSafe interop between Rust and C++
Rust Apache License 2.0 UpdatedJun 23, 2025 -
claude-code-router Public
Forked from musistudio/claude-code-routerUse Claude Code as the foundation for coding infrastructure, allowing you to decide how to interact with the model while enjoying updates from Anthropic.
TypeScript MIT License UpdatedJun 22, 2025 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJun 22, 2025 -
slime Public
Forked from THUDM/slimeslime is a LLM post-training framework aiming at scaling RL.
Python Apache License 2.0 UpdatedJun 19, 2025 -
volo Public
Forked from cloudwego/voloRust RPC framework with high-performance and strong-extensibility for building micro-services.
Rust Apache License 2.0 UpdatedJun 19, 2025 -
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedJun 17, 2025 -
opendal Public
Forked from apache/opendalApache OpenDAL: One Layer, All Storage.
Rust Apache License 2.0 UpdatedJun 16, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedJun 14, 2025 -
circuit-tracer Public
Forked from safety-research/circuit-tracerJavaScript MIT License UpdatedJun 11, 2025 -
Daft Public
Forked from Eventual-Inc/DaftDistributed data engine for Python/SQL designed for the cloud, powered by Rust
Rust Apache License 2.0 UpdatedJun 8, 2025 -
cudarc Public
Forked from chelsea0x3b/cudarcSafe rust wrapper around CUDA toolkit
Rust Apache License 2.0 UpdatedJun 5, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedJun 4, 2025 -
nixl Public
Forked from ai-dynamo/nixlNVIDIA Inference Xfer Library (NIXL)
C++ Apache License 2.0 UpdatedJun 3, 2025 -
aibrix Public
Forked from vllm-project/aibrixCost-efficient and pluggable Infrastructure components for GenAI inference
Jupyter Notebook Apache License 2.0 UpdatedJun 2, 2025 -
rocksdb Public
Forked from facebook/rocksdbA library that provides an embeddable, persistent key-value store for fast storage.
C++ GNU General Public License v2.0 UpdatedMay 30, 2025 -
llumnix Public
Forked from AlibabaPAI/llumnixEfficient and easy multi-instance LLM serving
Python Apache License 2.0 UpdatedMay 28, 2025 -
perf_analyzer Public
Forked from triton-inference-server/perf_analyzer -
llama.cpp Public
Forked from ggml-org/llama.cppLLM inference in C/C++
C++ MIT License UpdatedMay 27, 2025