-
-
cupti-tutorial Public
Forked from eunomia-bpf/cupti-tutorialTutorials for NVIDIA CUPTI samples
C++ UpdatedNov 3, 2025 -
-
-
-
-
open-gpu-kernel-modules Public
Forked from NVIDIA/open-gpu-kernel-modulesNVIDIA Linux open GPU kernel module source
C Other UpdatedSep 28, 2025 -
GustANN Public
Forked from thustorage/GustANNHigh-Throughput, Cost-Effective Billion-Scale Vector Search with a Single GPU [to appear in SIGMOD'26]
Cuda UpdatedSep 26, 2025 -
open-gpu-doc Public
Forked from NVIDIA/open-gpu-docDocumentation of NVIDIA chip/hardware interfaces
C MIT License UpdatedAug 18, 2025 -
cuda_scheduling_examiner_mirror Public
Forked from JoshuaJB/cuda_scheduling_examiner_mirrorA tool for examining GPU scheduling behavior.
Cuda Other UpdatedJun 26, 2025 -
third_party Public
Forked from triton-inference-server/third_partyThird-party source packages that are modified for use in Triton.
C BSD 3-Clause "New" or "Revised" License UpdatedJun 22, 2025 -
onnxruntime_backend Public
Forked from triton-inference-server/onnxruntime_backendThe Triton backend for the ONNX Runtime.
C++ BSD 3-Clause "New" or "Revised" License UpdatedJun 18, 2025 -
pytorch_backend Public
Forked from triton-inference-server/pytorch_backendThe Triton backend for the PyTorch TorchScript models.
C++ BSD 3-Clause "New" or "Revised" License UpdatedJun 18, 2025 -
-
vllm_backend Public
Forked from triton-inference-server/vllm_backendPython BSD 3-Clause "New" or "Revised" License UpdatedJun 10, 2025 -
python_backend Public
Forked from triton-inference-server/python_backendTriton backend that enables pre-process, post-processing and other logic to be implemented in Python.
C++ BSD 3-Clause "New" or "Revised" License UpdatedJun 10, 2025 -
ann-benchmarks Public
Forked from erikbern/ann-benchmarksBenchmarks of approximate nearest neighbor libraries in Python
Python MIT License UpdatedJun 10, 2025 -
cuml Public
Forked from rapidsai/cumlcuML - RAPIDS Machine Learning Library
C++ Apache License 2.0 UpdatedJun 7, 2025 -
cudf Public
Forked from rapidsai/cudfcuDF - GPU DataFrame Library
C++ Apache License 2.0 UpdatedMay 31, 2025 -
-
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedMay 7, 2025 -
rust-openssl Public
Forked from rust-openssl/rust-opensslOpenSSL bindings for Rust
Rust UpdatedApr 30, 2025 -
llm-analysis Public
Forked from cli99/llm-analysisLatency and Memory Analysis of Transformer Models for Training and Inference
Python Apache License 2.0 UpdatedApr 19, 2025 -
-
cbomkit Public
Forked from cbomkit/cbomkitA toolset for dealing with Cryptography Bill of Materials (CBOM)
Java Apache License 2.0 UpdatedApr 9, 2025 -
CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]
Cuda UpdatedMar 3, 2025 -
cupynumeric Public
Forked from nv-legate/cupynumericAn Aspiring Drop-In Replacement for NumPy at Scale
Python Apache License 2.0 UpdatedFeb 7, 2025 -
legate-sparse Public
Forked from nv-legate/legate-sparseLegate Sparse is a Legate library that aims to provide a distributed and accelerated drop-in replacement for the scipy.sparse library on top of the Legate runtime
Python Apache License 2.0 UpdatedFeb 3, 2025 -
-
serve Public
Forked from pytorch/serveServe, optimize and scale PyTorch models in production
Java Apache License 2.0 UpdatedDec 2, 2024