-
NECSTLab - Politecnico di Milano
- Milan, Italy
Highlights
- Pro
Lists (6)
Sort Name ascending (A-Z)
Stars
Graph database optimized for fast analysis and real-time data processing. It is provided as an extension to PostgreSQL.
A tool for bandwidth measurements on NVIDIA GPUs.
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
MSCCL++: A GPU-driven communication stack for scalable AI applications
Large Language Model (LLM) Systems Paper List
Instagram without all the toxic features like reels, home page, explore page. You can still view your friend's reels, stories, view profiles and text friends
NVIDIA Linux open GPU kernel module source
A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarking suites which are either insufficient or outdated.
Microbenchmark that unveals the mechanisms behind power readings reported by nvidia-smi on your NVIDIA GPU.
Benchmarking Deep Learning operations on different hardware
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.
The definitive Web UI for local AI, with powerful features and easy setup.
Easy Docker setup for Stable Diffusion with user-friendly UI
Hummingbird compiles trained ML models into tensor computation for faster inference.
Examples demonstrating available options to program multiple GPUs in a single node or a cluster
Collections of vector search related libraries, service and research papers
A curated list of awesome works related to high dimensional structure/vector search & database
📚 Awesome papers and technical blogs on vector DB (database), semantic-based vector search or approximate nearest neighbor search (ANN Search, ANNS). Vector search is the key component of large-sca…
NVIDIA curated collection of educational resources related to general purpose GPU programming.
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
VHDL implementation of a working-zone based encoding of addresses
Collection of small examples for running on ALCF resources
This package includes the implementation for four sparse linear algebra kernels: Sparse-Matrix-Vector-Multiplication (SpMV), Sparse-Triangular-Solve (SpTRSV), Sparse-Matrix-Transposition (SpTrans) …
📶 A curated list of awesome ESP8266/32 projects and code
Optimized primitives for collective multi-GPU communication
SYCL Academy, a set of learning materials for SYCL heterogeneous programming