Highlights
- Pro
Stars
Syncthing-Fork - A Syncthing Wrapper for Android.
Provides compile-time contraction pattern analysis to determine optimal tensor operation to perform.
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python ⚡
cuTile is a programming model for writing parallel kernels for NVIDIA GPUs
Tool for generating Clang's JSON Compilation Database files for make-based build systems.
Efficient Distributed GPU Programming for Exascale, an SC/ISC Tutorial
Home for cuQuantum Python & NVIDIA cuQuantum SDK C++ samples
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…
CUDA Python: Performance meets Productivity
Optimized primitives for collective multi-GPU communication
C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows
A Python framework for accelerated simulation, data generation and spatial computing.
NVIDIA curated collection of educational resources related to general purpose GPU programming.
An transformer based LLM. Written completely in Rust
GPU programming related news and material links
nanobind: tiny and efficient C++/Python bindings
Open-Source Quantum Chemistry – an electronic structure package in C++ driven by Python
A Julia Basket of Hand-Picked Krylov Methods
A cheatsheet of modern C++ language and library features.