Stars
FlagGems is an operator library for large language models implemented in the Triton Language.
vLLM Kunlun (vllm-kunlun) is a community-maintained hardware plugin designed to seamlessly run vLLM on the Kunlun XPU.
A high-performance inference engine for LLMs, optimized for diverse AI accelerators.
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A high-throughput and memory-efficient inference and serving engine for LLMs
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
A book for Learning the Foundations of LLMs
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Learn Low Level Design (LLD) and prepare for interviews using free resources.
ImageMagick is a free, open-source software suite for creating, editing, converting, and displaying images. It supports 200+ formats and offers powerful command-line tools and APIs for automation, …
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
A curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Learn System Design concepts and prepare for interviews using free resources.
Systems design is the process of defining the architecture, modules, interfaces, and data for a system to satisfy specified requirements. Systems design could be seen as the application of systems …
This is a Chinese translation of the CUDA programming guide
a software library containing BLAS functions written in OpenCL
📚 Freely available programming books
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…
ImageJS--Image analysis in the browser with ImageJ
Public domain software for processing and analyzing scientific images
Embedded graphics library to create beautiful UIs for any MCU, MPU and display type.
MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.