-
Open to new work opportunities.
- Chengdu
- [email protected]
- https://googs1025.github.io/
- @googs1025
- https://www.kubernetes.dev/community/awards/2025/#scheduling
Lists (11)
Sort Name ascending (A-Z)
Starred repositories
A CNI IPAM plugin that assigns IP addresses cluster-wide
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
A high-performance and light-weight router for vLLM large scale deployment
Unified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
verl: Volcano Engine Reinforcement Learning for LLMs
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
FlashInfer: Kernel Library for LLM Serving
A framework for efficient model inference with omni-modality models
Golang package for gossip based membership and failure detection
Rust implementation of proxychains — injectable library for chaining TCP/UDP connections via multiple proxies.
针对 atexec 进行了改进,增加了随机化和隐蔽性特征,以降低被检测的风险。
Run kubectl commands against multiple clusters at once
Variant optimization autoscaler for distributed inference workloads
Open Source Landscapes and Insights Produced by AntOSS
⚒️ AlphaTrion is an open-source framework to help build GenAI applications, including experiment tracking, adaptive model routing, prompt optimization and performance evaluation.
Kata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
Rust 实现的 BitTorrent DHT(BEP‑5)与爬虫,并内置元数据下载(BEP‑9/10)与 PeX(BEP‑11/ut_pex)。
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
This repository contains a reference implementation of NodeReadinessGates as a Kubernetes controller that manages node taints based on multiple readiness gate conditions, providing fine-grained con…
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
rusub 是一款高速、智能的跨平台子域枚举工具,支持启发式扫描、内置 10 万+ 词表、异步高并发、多格式输出及自动断点续传。