-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJan 17, 2026 -
-
-
-
torch_mpi_ext Public
Forked from pytorch/extension-cppC++ extensions in PyTorch
Python UpdatedDec 26, 2025 -
xdsl Public
Forked from xdslproject/xdslA Python compiler design toolkit.
Python Other UpdatedNov 25, 2025 -
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedSep 22, 2025 -
-
-
-
-
mirage Public
Forked from mirage-project/mirageMirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA
C++ Apache License 2.0 UpdatedDec 4, 2024 -
-
blaspp Public
Forked from icl-utk-edu/blasppBLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.
C++ BSD 3-Clause "New" or "Revised" License UpdatedJul 2, 2024 -
cgen Public
Forked from inducer/cgenC/C++ source generation from an AST
Python Other UpdatedJun 7, 2024 -
-
HeCBench Public
Forked from zjin-lcf/HeCBenchC++ BSD 3-Clause "New" or "Revised" License UpdatedFeb 14, 2024 -
gitlab-cmake-ci Public
Modern Cmake C++ project example, with codespell, cmake, cpppcheck clang-format clang-tidy lcov gcovr support.
-
multiarray Public
A simple C++ templated multiarray class for array, a header-only library
-
-
-
-
-
-
-
-
timeprof Public
timeprof is a simple C++ library for profiling code regions to measure execution time.
-
-