-
Zhejiang University
- Hangzhou, China
-
15:50
(UTC +08:00)
-
BasicCUDA Public
Forked from CalvinXKY/BasicCUDAA tutorial for CUDA&PyTorch
C++ UpdatedAug 26, 2024 -
-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedJun 14, 2024 -
nccl Public
Forked from NVIDIA/ncclOptimized primitives for collective multi-GPU communication
C++ Other UpdatedJun 4, 2024 -
Paddle Public
Forked from PaddlePaddle/PaddlePArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
C++ Apache License 2.0 UpdatedMay 10, 2024 -
PaddleAutoProject Public
Forked from PFCCLab/PaddleAutoProject基于PaddlePaddle的自动化功能开发小组
Python Apache License 2.0 UpdatedApr 15, 2024 -
community Public
Forked from PaddlePaddle/communityPaddlePaddle Developer Community
Jupyter Notebook Apache License 2.0 UpdatedApr 15, 2024 -
openmlsys-zh Public
Forked from openmlsys/openmlsys-zh《Machine Learning Systems: Design and Implementation》- Chinese Version
TeX UpdatedApr 13, 2024 -
CUDATutorial Public
Forked from PaddleJitLab/CUDATutorialA self-learning tutorail for CUDA High Performance Programing.
JavaScript Apache License 2.0 UpdatedApr 10, 2024 -
-
PaddleNLP Public
Forked from PaddlePaddle/PaddleNLP👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search…
Python Apache License 2.0 UpdatedMar 29, 2024 -
-
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedJan 26, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 26, 2024 -
docs Public
Forked from PaddlePaddle/docsDocumentations for PaddlePaddle
Python Apache License 2.0 UpdatedJan 17, 2024 -
MNN Public
Forked from alibaba/MNNMNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
C++ UpdatedDec 29, 2023 -
-
-
mmpose Public
Forked from open-mmlab/mmposeOpenMMLab Pose Estimation Toolbox and Benchmark.
Python Apache License 2.0 UpdatedSep 8, 2023 -
oneflow Public
Forked from Oneflow-Inc/oneflowOneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
C++ Apache License 2.0 UpdatedJun 30, 2023 -
ncnn Public
Forked from Tencent/ncnnncnn is a high-performance neural network inference framework optimized for the mobile platform
C++ Other UpdatedMay 19, 2023 -
thrust Public
Forked from NVIDIA/thrustThe C++ parallel algorithms library.
C++ Other UpdatedMay 11, 2023 -
mmdeploy Public
Forked from open-mmlab/mmdeployOpenMMLab Model Deployment Framework
Python Apache License 2.0 UpdatedApr 18, 2023 -
mmaction2 Public
Forked from open-mmlab/mmaction2OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Python Apache License 2.0 UpdatedApr 16, 2023 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda Other UpdatedMar 28, 2023 -
Needle Public
A basic deep learning library, comparable to a very minimal version of PyTorch.
-
-
AlphaPose Public
Forked from MVIG-SJTU/AlphaPoseReal-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
Python Other UpdatedAug 27, 2022 -
learning-cuda-trt Public
Forked from jinmin527/learning-cuda-trtA large number of cuda/tensorrt cases . 大量案例来学习cuda/tensorrt
C++ MIT License UpdatedJul 24, 2022 -