TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its…

C++ 4,593 772 Updated May 9, 2025

facebookincubator / AITemplate

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Python 4,694 382 Updated Oct 27, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,417 2,277 Updated Nov 12, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,332 4,354 Updated Nov 26, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,354 923 Updated Mar 27, 2024

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,884 3,033 Updated Nov 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

neiltian neiltian-tencent

Achievements

Achievements

Block or report neiltian-tencent

Stars

sgl-project / sglang

deepseek-ai / open-infra-index

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

zilliztech / GPTCache

vllm-project / llm-compressor

pytorch / pytorch

triton-lang / triton

siliconflow / onediff

microsoft / onnxruntime

hpcaitech / ColossalAI

Oneflow-Inc / oneflow

deepspeedai / DeepSpeed

Tencent / TNN

facebookincubator / AITemplate

NVIDIA / TensorRT

Tencent / ncnn

NVIDIA / FasterTransformer

lutzroeder / netron