Lists (1)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
Cloud native networking and network security
A PyTorch native platform for training generative AI models
CUDA Python: Performance meets Productivity
A next generation Python CMake adaptor and Python API for plugins
NVIDIA curated collection of educational resources related to general purpose GPU programming.
A Datacenter Scale Distributed Inference Serving Framework
A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
making the official triton tutorials actually comprehensible
Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.
Fully open reproduction of DeepSeek-R1
Open-source search and retrieval database for AI applications.
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Distributed Task Queue (development branch)
Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.
a language for fast, portable data-parallel computation
A high-throughput and memory-efficient inference and serving engine for LLMs
Ghidra is a software reverse engineering (SRE) framework
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Protocol Buffers - Google's data interchange format
tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)
A lightweight LLVM python binding for writing JIT compilers
🦜🔗 Build context-aware reasoning applications