CS PhD student at UC Berkeley, building @vllm-project
-
University of California, Berkeley
- Berkeley, CA
- https://woosuk.me
- @woosuk_k
Highlights
- Pro
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedSep 11, 2025 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
dynamo Public
Forked from ai-dynamo/dynamoA Datacenter Scale Distributed Inference Serving Framework
Rust Apache License 2.0 UpdatedMay 17, 2025 -
-
subpop Public
Forked from JosephJeesungSuh/subpopOfficial repository for Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions
Python BSD 3-Clause "New" or "Revised" License UpdatedFeb 25, 2025 -
torch-xla Public
Forked from pytorch/xlaEnabling PyTorch on XLA Devices (e.g. Google TPU)
-
retraining-free-pruning Public
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers