KarmaD7

💤

Shaofeng Ding KarmaD7

💤

Ph.D. Student, Tsinghua MADSys Group. Distributed System & AI Infra.

86 followers · 80 following

Tsinghua University
Beijing, China

Achievements

Highlights

Starred repositories

simongog / sdsl-lite

Succinct Data Structure Library 2.0

C++ 2,285 352 Updated Jun 2, 2023

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,934 696 Updated Nov 7, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 20,075 3,317 Updated Nov 10, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,245 421 Updated Nov 9, 2025

CMU-SAFARI / Virtuoso

Virtuoso is a fast, accurate and versatile simulation framework designed for virtual memory research. Virtuoso uses a new simulation methodology for estimating OS overheads and models diverse VM de…

C++ 75 14 Updated Oct 15, 2025

Zippland / worth-calculator

Calculating the actual value of your job beyond just salary

TypeScript 2,805 170 Updated Oct 14, 2025

deepseek-ai / 3FS

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,451 958 Updated Oct 24, 2025

deepseek-ai / DualPipe

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,874 305 Updated Mar 10, 2025

deepseek-ai / EPLB

Expert Parallelism Load Balancer

Python 1,291 195 Updated Mar 24, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,870 739 Updated Oct 15, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,706 979 Updated Nov 6, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,855 897 Updated Sep 30, 2025

c7w / papersgpt-for-zotero-sidebar

JavaScript 67 2 Updated Jun 30, 2025

DawnCarol / DawnCarol

209 28 Updated Apr 13, 2025

stephentu / silo

Multicore in-memory storage engine

C++ 395 119 Updated Oct 10, 2017

leverimmy / THU-Annual-Eat

一年过去了，你在华子食堂里花的钱都花在哪儿了？

Python 469 78 Updated Dec 23, 2024

ChenDolph7in / TraceGuard

Jupyter Notebook 6 Updated Dec 17, 2024

cmuparlay / parlaylib

A Toolkit for Programming Parallel Algorithms on Shared-Memory Multicore Machines

C++ 394 75 Updated Sep 18, 2025

opensmartnic / awesome-smartnic

A curated list of awesome smartnic tutorials, papers and projects.

282 37 Updated Oct 27, 2025

smartnickit-project / smartnic-bench

A rust-based benchmark for BlueField SmartNICs.

Rust 30 4 Updated Jul 5, 2023

dmemsys / awesome-disaggregated-memory

A collection of awesome researchers and papers about disaggregated memory.

173 14 Updated Oct 14, 2025

redn-io / RedN

Arbitrary offloads for RDMA NICs

C 98 21 Updated Apr 25, 2022

apache / brpc

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 17,361 4,068 Updated Nov 9, 2025