-
FastDeploy Public
Forked from PaddlePaddle/FastDeployHigh-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle
Python Apache License 2.0 UpdatedNov 11, 2025 -
-
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedOct 20, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedSep 16, 2025 -
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 2, 2025 -
PaddleNLP Public
Forked from PaddlePaddle/PaddleNLP👑 Easy-to-use and powerful NLP library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Ques…
Python Apache License 2.0 UpdatedApr 10, 2025 -
PaddleMIX Public
Forked from PaddlePaddle/PaddleMIXPaddle Multimodal Integration and eXploration, supporting text-to-image, image generation, multi-modal CV tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model too…
Python Apache License 2.0 UpdatedFeb 25, 2025 -
Paddle-Lite Public
Forked from PaddlePaddle/Paddle-LiteMulti-platform high performance deep learning inference engine (『飞桨』多平台高性能深度学习预测引擎)
C++ Apache License 2.0 UpdatedAug 14, 2024 -
-
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedDec 27, 2023 -
-
PaddleTest Public
Forked from PaddlePaddle/PaddleTestPaddlePaddle TestSuite
Python UpdatedJun 5, 2023 -
-
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedApr 4, 2023 -
-
-
transformers-benchmarks Public
Forked from mli/transformers-benchmarksreal Transformer TeraFLOPS on various GPUs
Jupyter Notebook Apache License 2.0 UpdatedSep 2, 2022 -
-
YHs_Sample Public
Forked from Yinghan-Li/YHs_SampleYinghan's Code Sample
Cuda GNU General Public License v3.0 UpdatedJul 25, 2022 -
PaddleSeg Public
Forked from PaddlePaddle/PaddleSegEasy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
Python Apache License 2.0 UpdatedJul 11, 2022 -
-
Paddle-Inference-Demo Public
Forked from PaddlePaddle/Paddle-Inference-DemoC++ Apache License 2.0 UpdatedJun 23, 2022 -
-
Paddle-Lite-Demo Public
Forked from PaddlePaddle/Paddle-Lite-Demolib, demo, model, data
Java Apache License 2.0 UpdatedMay 6, 2022 -
trt-samples-for-hackathon-cn Public
Forked from NVIDIA/trt-samples-for-hackathon-cnSimple samples for TensorRT programming
Python Apache License 2.0 UpdatedApr 14, 2022 -