Stars
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
The Fastest DNN Running Framework on Web Browser
An open-source C++ library developed and used at Facebook.
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Simple and comprehensive tutorials in TensorFlow
a TensorFlow-based distributed training framework optimized for large-scale sparse data.
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Pre-built ARM/Linux C cross-compilers for MacOS
Library for specialized dense and sparse matrix operations, and deep learning primitives.
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
🚀 gnet is a high-performance, lightweight, non-blocking, event-driven networking framework written in pure Go.
🐜🐜🐜 ants is the most powerful and reliable pooling solution for Go.
benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…