Skip to content
View powerpwang's full-sized avatar

Block or report powerpwang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,296 4,686 Updated Jan 18, 2026

The Fastest DNN Running Framework on Web Browser

TypeScript 1,999 148 Updated Jun 7, 2025

An open-source C++ library developed and used at Facebook.

C++ 30,221 5,840 Updated Jan 18, 2026

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

C++ 11,986 1,873 Updated Jul 6, 2025

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Python 12,808 2,077 Updated Jan 23, 2024

Simple and comprehensive tutorials in TensorFlow

Python 2,860 364 Updated Jul 8, 2024

a TensorFlow-based distributed training framework optimized for large-scale sparse data.

C++ 333 71 Updated Dec 23, 2025

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Cuda 631 142 Updated Sep 4, 2025

Abseil Common Libraries (C++)

C++ 16,948 2,951 Updated Jan 16, 2026

cuDF - GPU DataFrame Library

C++ 9,465 1,001 Updated Jan 19, 2026

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

C++ 1,041 205 Updated Sep 15, 2025

DyNet: The Dynamic Neural Network Toolkit

C++ 3,434 705 Updated Dec 1, 2023

Model Quantization Benchmark

Python 855 141 Updated Apr 20, 2025

Intel PMU profiling tools

Python 2,208 357 Updated Jan 8, 2026

System for AI Education Resource.

Python 4,206 520 Updated Oct 25, 2024

A primitive library for neural network

C++ 1,368 223 Updated Nov 24, 2024

Pre-built ARM/Linux C cross-compilers for MacOS

127 5 Updated May 16, 2022

Library for specialized dense and sparse matrix operations, and deep learning primitives.

C 933 199 Updated Jan 8, 2026

[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration

C++ 200 33 Updated Apr 27, 2022

🚀 gnet is a high-performance, lightweight, non-blocking, event-driven networking framework written in pure Go.

Go 11,049 1,105 Updated Dec 27, 2025

🐜🐜🐜 ants is the most powerful and reliable pooling solution for Go.

Go 14,262 1,428 Updated Dec 27, 2025
C++ 54 12 Updated Sep 23, 2020

benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.

Python 204 29 Updated Feb 18, 2021

Vector class library, latest version

C++ 1,427 156 Updated Feb 1, 2024

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…

C++ 4,613 773 Updated May 9, 2025