Skip to content
View beoy's full-sized avatar

Block or report beoy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.

Python 709 92 Updated Jan 13, 2026

A framework for efficient model inference with omni-modality models

Python 2,097 279 Updated Jan 13, 2026

Fast low-bit matmul kernels in Triton

Python 420 31 Updated Dec 18, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 82,839 12,452 Updated Jan 10, 2026

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,578 727 Updated Jan 7, 2026

Cloud native networking and network security

Go 6,996 1,527 Updated Jan 13, 2026

Tools for building GPU clusters

Shell 1,408 350 Updated Jan 9, 2026

A PyTorch native platform for training generative AI models

Python 4,954 663 Updated Jan 13, 2026

CUDA Python: Performance meets Productivity

Cython 3,129 236 Updated Jan 12, 2026

A next generation Python CMake adaptor and Python API for plugins

Python 430 80 Updated Jan 9, 2026

NVIDIA curated collection of educational resources related to general purpose GPU programming.

Jupyter Notebook 1,076 191 Updated Jan 12, 2026

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,767 776 Updated Jan 13, 2026

A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS

247 12 Updated May 6, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 99,983 11,339 Updated Jan 13, 2026

making the official triton tutorials actually comprehensible

Python 94 23 Updated Aug 25, 2025

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,009 103 Updated Jul 29, 2024

Fully open reproduction of DeepSeek-R1

Python 25,812 2,410 Updated Nov 24, 2025

Open-source search and retrieval database for AI applications.

Rust 25,430 1,996 Updated Jan 13, 2026

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

Python 28,148 4,539 Updated Jan 4, 2026

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,092 1,619 Updated Jan 13, 2026

PyZMQ: Python bindings for zeromq

Python 4,070 656 Updated Jan 5, 2026

Distributed Task Queue (development branch)

Python 27,861 4,929 Updated Jan 10, 2026

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 159,309 14,140 Updated Jan 13, 2026

a language for fast, portable data-parallel computation

C++ 6,516 1,096 Updated Dec 28, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,369 12,549 Updated Jan 13, 2026

Ghidra is a software reverse engineering (SRE) framework

Java 63,474 7,051 Updated Jan 8, 2026

TensorFlow/TensorRT integration

Jupyter Notebook 743 224 Updated Nov 30, 2023

Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Python 15,254 1,705 Updated Jun 25, 2025

Protocol Buffers - Google's data interchange format

C++ 70,294 16,000 Updated Jan 13, 2026
Next