Skip to content
View ztiy's full-sized avatar

Block or report ztiy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Nano vLLM

Python 9,053 1,095 Updated Nov 3, 2025

Cloud Native Policy Management

Go 7,096 1,147 Updated Nov 18, 2025

结巴中文分词

Python 34,561 6,735 Updated Aug 21, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 10,910 1,113 Updated Apr 30, 2025

Leetcode for Pytorch

Jupyter Notebook 1,669 191 Updated Jul 26, 2025

Machine Learning Engineering Open Book

Python 15,773 968 Updated Oct 27, 2025

dev tools, env vars, task runner

Rust 21,465 739 Updated Nov 18, 2025

AuctionGym is a simulation environment that enables reproducible evaluation of bandit and reinforcement learning methods for online advertising auctions.

Jupyter Notebook 181 46 Updated Jun 18, 2025

A universal scalable machine learning model deployment solution

Java 240 82 Updated Nov 18, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 33,995 3,252 Updated Nov 18, 2025

LLM inference in C/C++

C++ 90,018 13,738 Updated Nov 18, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 373 24 Updated Jul 8, 2025

The Finch CLI is an open source client for container development

Go 3,921 108 Updated Nov 18, 2025

A intuitive, lightweight web framework in C for building modern web applications

C 954 45 Updated Oct 21, 2025

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 10,666 1,188 Updated Nov 17, 2025

Free IDE for Kubernetes

TypeScript 3,797 134 Updated Nov 18, 2025

Define Kubernetes native apps and abstractions using object-oriented programming

JavaScript 4,720 306 Updated Nov 18, 2025

Parallel S3 and local filesystem execution tool.

Go 3,739 315 Updated Jun 13, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,385 99 Updated Nov 16, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,501 837 Updated Nov 6, 2025

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++ 655 186 Updated Nov 14, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,895 745 Updated Nov 14, 2025

Kubernetes Operator for MPI-based applications (distributed training, HPC, etc.)

Go 497 228 Updated Nov 18, 2025

Work with remote images registries - retrieving information, images, signing content

Go 10,028 881 Updated Nov 14, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,619 1,861 Updated Nov 17, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,660 306 Updated Oct 20, 2025

Amazon Nova Act is a research preview of a new AI model for developers to build agents that take actions in web browsers

Python 837 137 Updated Nov 18, 2025

Open standard for machine learning interoperability

Python 19,897 3,829 Updated Nov 7, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 25,175 4,700 Updated Oct 9, 2025
Next