dywsjtu

Focusing

Yinwei Dai dywsjtu

Focusing

CS Ph.D. student @princeton Previously studied electrical and computer engineering @sjtu and computer science @umich

81 followers · 19 following

Princeton University
Princeton, NJ
23:28 (UTC -05:00)
https://yinwei-dai.com
@dai_yinwei

Achievements

Highlights

Organizations

Stars

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,029 86 Updated Nov 27, 2025

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,118 319 Updated Oct 17, 2025

omniglot-rs / omniglot

Safe Interactions with Foreign Languages through Omniglot

Rust 43 1 Updated Nov 9, 2025

algorithmicsuperintelligence / optillm

Optimizing inference proxy for LLMs

Python 3,191 249 Updated Nov 20, 2025

harleyszhang / llm_counts

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

Python 113 10 Updated Jul 11, 2025

vllm-project / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 724 102 Updated Nov 28, 2025

perplexityai / pplx-kernels

Perplexity GPU Kernels

C++ 532 72 Updated Nov 7, 2025

DerrickYLJ / TidalDecode

[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Python 49 4 Updated Aug 6, 2025

pyember / ember

Python 233 35 Updated Jun 25, 2025

ruipeterpan / marconi

Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]

Python 46 3 Updated Mar 5, 2025

bytedance / flux

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,181 85 Updated Aug 28, 2025

bytedance / ByteMLPerf

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 275 85 Updated Aug 18, 2025

hemingkx / Awesome-Efficient-Reasoning

Paper list for Efficient Reasoning.

736 27 Updated Nov 20, 2025

youngsoul0731 / FLORA-Bench

[Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances

HTML 21 3 Updated Nov 15, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,560 713 Updated Nov 28, 2025

Just-Curieous / Curie

❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents

Python 306 25 Updated Sep 28, 2025

cornstarch-org / Cornstarch

Python 113 5 Updated Sep 5, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,332 445 Updated Nov 28, 2025

openai / openai-agents-python

A lightweight, powerful framework for multi-agent workflows

Python 17,561 2,916 Updated Nov 27, 2025

junchenzhi / Awesome-LLM-Ensemble

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

HTML 166 15 Updated Nov 21, 2025

zhijian-liu / torchprofile

A general and accurate MACs / FLOPs profiler for PyTorch models

Python 630 43 Updated Jul 29, 2025

appnet-org / appnet

Expressive, Easy to Build, and High-Performance Application Networks

Go 18 6 Updated Jul 1, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,764 1,006 Updated Nov 25, 2025

MrYxJ / calculate-flops.pytorch

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 899 37 Updated Jun 27, 2024

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,932 286 Updated May 15, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 20,465 3,545 Updated Nov 28, 2025

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

463 31 Updated Nov 15, 2025

HPMLL / BurstGPT

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 220 11 Updated Jul 24, 2025

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

1,639 87 Updated Nov 27, 2025

Azure / AzurePublicDataset

Microsoft Azure Traces

Jupyter Notebook 1,032 171 Updated Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yinwei Dai dywsjtu

Achievements

Achievements

Highlights

Organizations

Block or report dywsjtu

Stars

jonyzhang2023 / awesome-embodied-vla-va-vln

GT-RIPL / Awesome-LLM-Robotics

omniglot-rs / omniglot

algorithmicsuperintelligence / optillm

harleyszhang / llm_counts

vllm-project / guidellm

perplexityai / pplx-kernels

DerrickYLJ / TidalDecode

pyember / ember

ruipeterpan / marconi

bytedance / flux

bytedance / ByteMLPerf

hemingkx / Awesome-Efficient-Reasoning

youngsoul0731 / FLORA-Bench

ai-dynamo / dynamo

Just-Curieous / Curie

cornstarch-org / Cornstarch

kvcache-ai / Mooncake

openai / openai-agents-python

junchenzhi / Awesome-LLM-Ensemble

zhijian-liu / torchprofile

appnet-org / appnet

deepseek-ai / DeepEP

MrYxJ / calculate-flops.pytorch

deepseek-ai / open-infra-index

sgl-project / sglang

byungsoo-oh / ml-systems-papers

HPMLL / BurstGPT

AmberLJC / LLMSys-PaperList

Azure / AzurePublicDataset