Skip to content
View jiasheng55's full-sized avatar
😀
follow my heart
😀
follow my heart

Block or report jiasheng55

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 2,900 540 Updated Nov 7, 2025

Sandbox implemented in GO including containers (namespace, cgroup), ptrace, seccomp

Go 236 36 Updated Oct 11, 2025

A tool for bandwidth measurements on NVIDIA GPUs.

C++ 564 63 Updated Apr 15, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,415 2,176 Updated Sep 5, 2025

NCCL Tests

Cuda 1,328 327 Updated Nov 3, 2025

Benchmarking guide for the Azure AI Infrastructure.

Python 37 12 Updated Oct 30, 2025

Recipes for reproducing training and serving benchmarks for large machine learning models using GPUs on Google Cloud.

Python 96 44 Updated Nov 5, 2025

GPU & cluster health and performance monitoring solution for OCI

HCL 9 2 Updated Nov 8, 2025
Shell 2 1 Updated Jan 25, 2025

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML 571 89 Updated Nov 10, 2025

GPU documentation for humans

Python 391 47 Updated Oct 3, 2025

ComScribe is a tool to identify communication among all GPU-GPU and CPU-GPU pairs in a single-node multi-GPU system.

C++ 27 4 Updated Jul 6, 2023

CP-Bench is a PyTorch testing/benchmarking suite to detect AI hardware issues, such as functional reliability, silent data corruption, and performance anomalies

Python 4 1 Updated Nov 7, 2025

Azure HPC/AI VM Images

Shell 121 95 Updated Nov 5, 2025

The Cloud Native Control Plane

Go 11,024 1,099 Updated Nov 6, 2025

🐊 Policy Controller for Kubernetes

Go 4,045 826 Updated Nov 10, 2025

从零实现一个 llama3 中文版

Jupyter Notebook 984 97 Updated Jun 12, 2024

从零实现一个小参数量中文大语言模型。

Python 871 100 Updated Aug 22, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,789 1,098 Updated Apr 30, 2025

Automatically cordon and drain Kubernetes nodes based on node conditions

Go 669 88 Updated Mar 26, 2024

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,442 676 Updated Nov 10, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,929 286 Updated May 15, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,710 981 Updated Nov 6, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

67,104 7,582 Updated Jun 4, 2025

📚 从零开始的大语言模型原理与实践教程

Jupyter Notebook 21,195 1,874 Updated Nov 7, 2025

Python tool for converting files and office documents to Markdown.

Python 82,789 4,686 Updated Oct 20, 2025

Instant Kubernetes-Native Application Observability

C++ 6,226 478 Updated Oct 28, 2025

Build and run containers leveraging NVIDIA GPUs

Go 3,816 430 Updated Nov 10, 2025

a unified scheduler for online and offline tasks

Go 628 84 Updated Mar 26, 2025

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,606 68 Updated Jun 5, 2025
Next