Skip to content
View ztiy's full-sized avatar

Block or report ztiy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 2,195 366 Updated Aug 14, 2025

Optimized primitives for collective multi-GPU communication

C++ 4,267 1,074 Updated Nov 10, 2025

"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

Go 53,719 4,778 Updated Nov 24, 2025

Nano vLLM

Python 9,263 1,134 Updated Nov 3, 2025

Cloud Native Policy Management

Go 7,119 1,150 Updated Nov 26, 2025

结巴中文分词

Python 34,593 6,737 Updated Aug 21, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,022 1,118 Updated Apr 30, 2025

Leetcode for Pytorch

Jupyter Notebook 1,683 192 Updated Jul 26, 2025

Machine Learning Engineering Open Book

Python 15,846 977 Updated Nov 21, 2025

dev tools, env vars, task runner

Rust 21,681 748 Updated Nov 26, 2025

AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.

Jupyter Notebook 181 46 Updated Jun 18, 2025

A universal scalable machine learning model deployment solution

Java 241 82 Updated Nov 25, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,103 3,267 Updated Nov 26, 2025

LLM inference in C/C++

C++ 90,416 13,829 Updated Nov 26, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 374 24 Updated Jul 8, 2025

The Finch CLI is an open source client for container development

Go 3,925 108 Updated Nov 25, 2025

A intuitive, lightweight web framework in C for building modern web applications

C 958 45 Updated Oct 21, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,735 1,192 Updated Nov 24, 2025

Free IDE for Kubernetes

TypeScript 3,874 139 Updated Nov 25, 2025

Define Kubernetes native apps and abstractions using object-oriented programming

JavaScript 4,724 306 Updated Nov 25, 2025

Parallel S3 and local filesystem execution tool.

Go 3,750 318 Updated Jun 13, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,392 102 Updated Nov 19, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,606 845 Updated Nov 6, 2025

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++ 655 186 Updated Nov 25, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,912 752 Updated Nov 25, 2025

Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)

Go 498 228 Updated Nov 24, 2025

Work with remote images registries - retrieving information, images, signing content

Go 10,067 885 Updated Nov 25, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,646 1,863 Updated Nov 25, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,676 306 Updated Oct 20, 2025
Next