Lists (1)
Sort Name ascending (A-Z)
Stars
A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.
A framework for efficient model inference with omni-modality models
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
Cloud native networking and network security
A PyTorch native platform for training generative AI models
CUDA Python: Performance meets Productivity
A next generation Python CMake adaptor and Python API for plugins
NVIDIA curated collection of educational resources related to general purpose GPU programming.
A Datacenter Scale Distributed Inference Serving Framework
A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
making the official triton tutorials actually comprehensible
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Fully open reproduction of DeepSeek-R1
Open-source search and retrieval database for AI applications.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Distributed Task Queue (development branch)
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
a language for fast, portable data-parallel computation
A high-throughput and memory-efficient inference and serving engine for LLMs
Ghidra is a software reverse engineering (SRE) framework
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Protocol Buffers - Google's data interchange format