Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.
Simple, unified interface to multiple Generative AI providers
The lightweight, user-friendly, fault-tolerant database built on SQLite.
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…
Apache Doris is an easy-to-use, high performance and unified analytics database.
dperf: High-Performance Network Load Testing Tool Based on DPDK
MinIO is a high-performance, S3 compatible object store, open sourced under GNU AGPLv3 license.
RETIRED, Monasca REST API. Mirror of code maintained at opendev.org.