ZJY0516

Follow

Jiangyun Zhu ZJY0516

Follow

[ML]SYS

35 followers · 161 following

Chinese Academy of Sciences
Beijing, China
09:41 (UTC +08:00)
https://riverclouds.net/

Achievements

Achievements

Highlights

Pro

Lists (5)

Sort

LaTeX

学习资源

实用工具

影音

10 repositories

游戏

Stars

chengzeyi / ParaAttention

https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching

Python 393 40 Updated Jul 5, 2025

tile-ai / TileRT

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 261 14 Updated Nov 25, 2025

RustPython / RustPython

A Python Interpreter written in Rust

Rust 20,843 1,361 Updated Nov 27, 2025

3b1b / manim

Animation engine for explanatory math videos

Python 82,093 6,953 Updated Oct 20, 2025

aikitoria / nanotrace

Low overhead tracing library and trace visualizer for pipelined CUDA kernels

C 114 5 Updated Nov 26, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,249 1,895 Updated Nov 28, 2025

meta-pytorch / KernelAgent

Autonomous GPU Kernel Generation via Deep Agents

Python 161 16 Updated Nov 23, 2025

smart-lty / nano-PEARL

Draft-Target Disaggregation LLM Serving System via Parallel Speculative Decoding.

Python 123 7 Updated Nov 10, 2025

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,424 26,032 Updated Nov 28, 2025

meta-pytorch / torchforge

PyTorch-native post-training at scale

Python 549 67 Updated Nov 27, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 16,759 2,667 Updated Nov 27, 2025

zx2c4 / cgit

Read-only mirror of https://git.zx2c4.com/cgit/about . Pull requests and issues on GitHub cannot be accepted and will be automatically closed. The proper way to submit changes is via the mailing li…

C 194 26 Updated Nov 17, 2025

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 679 66 Updated Nov 27, 2025

kohler / hotcrp

HotCRP conference review software

PHP 378 127 Updated Nov 26, 2025

apache / tvm-ffi

Open ABI and FFI for Machine Learning Systems

C++ 201 38 Updated Nov 27, 2025

alibaba / ROLL

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,393 162 Updated Nov 27, 2025

QiuChenly / CoreInject

美好世界你我相伴。同性恋🏳️‍🌈看过来，在这里找到我们的最爱！

HTML 1,795 90 Updated Nov 19, 2025

X1a0He / Adobe-Downloader

macOS Adobe apps download & installer

Swift 2,555 115 Updated Nov 22, 2025

jumploop / uv-custom

Forked from Wangnov/uv-custom

这是一个与 astral-sh/uv 官方版本同步的镜像项目，旨在为国内用户提供更快速、更稳定的 uv 安装和使用体验。

Shell 3 Updated Nov 26, 2025

atomicapple0 / libsmctrl

Artifact from "Hardware Compute Partitioning on NVIDIA GPUs". THIS IS A FORK OF BAKITAS REPO. I AM NOT ONE OF THE AUTHORS OF THE PAPER.

C 41 3 Updated Nov 24, 2025

aschn / gnuplot-colorbrewer

ColorBrewer color schemes for gnuplot

Gnuplot 222 53 Updated Feb 16, 2017

sii-research / VCCL

Venus Collective Communication Library, supported by SII and Infrawaves.

C++ 116 4 Updated Nov 3, 2025

sjzar / chatlog

chat log tool, easily use your own chat data. 聊天记录工具，轻松使用自己的聊天数据

9,096 2,121 Updated Oct 20, 2025

traceloop / openllmetry

Open-source observability for your GenAI or LLM application, based on OpenTelemetry

Python 6,629 837 Updated Nov 27, 2025

MLSysOps / InfraGym

Empowering LLM Agents for Real-World Computer System Optimization

Python 14 1 Updated Sep 10, 2025

agent-network-protocol / AgentNetworkProtocol

AgentNetworkProtocol(ANP) is an open source protocol for agent communication. Our vision is to define how agents connect with each other, building an open, secure, and efficient collaboration netwo…

HTML 1,094 76 Updated Nov 26, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 849 68 Updated Nov 24, 2025

NVIDIA / nvshmem

NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process communication and coordination overheads by allowing programmer…

C++ 398 42 Updated Nov 13, 2025

google / perfetto

Production-grade client-side tracing, profiling, and analysis for complex software systems.

C++ 5,042 626 Updated Nov 28, 2025

prefix-dev / pixi

Package management made easy

Rust 5,761 384 Updated Nov 26, 2025