Stars
Current and Historical Lists of S&P 500 components since 1996
nanobind: tiny and efficient C++/Python bindings
Probably the fastest coroutine lib in the world!
💫 Toolkit to help you get started with Spec-Driven Development
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
High-performance distributed multi-tier cache system. Built in Rust.
CommonMark parsing and rendering library and program in C
Financial data platform for analysts, quants and AI agents.
SGLang is a fast serving framework for large language models and multi-modality models.
Stable Diffusion web UI
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Simple, safe way to store and distribute tensors
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence…