Skip to content
View dywsjtu's full-sized avatar
:shipit:
Focusing
:shipit:
Focusing

Highlights

  • Pro

Organizations

@SysML-Princeton

Block or report dywsjtu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

2,029 86 Updated Nov 27, 2025

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,118 319 Updated Oct 17, 2025

Safe Interactions with Foreign Languages through Omniglot

Rust 43 1 Updated Nov 9, 2025

Optimizing inference proxy for LLMs

Python 3,191 249 Updated Nov 20, 2025

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

Python 113 10 Updated Jul 11, 2025

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 724 102 Updated Nov 28, 2025

Perplexity GPU Kernels

C++ 532 72 Updated Nov 7, 2025

[ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention

Python 49 4 Updated Aug 6, 2025
Python 233 35 Updated Jun 25, 2025

Artifact for "Marconi: Prefix Caching for the Era of Hybrid LLMs" [MLSys '25 Outstanding Paper Award, Honorable Mention]

Python 46 3 Updated Mar 5, 2025

A fast communication-overlapping library for tensor/expert parallelism on GPUs.

C++ 1,181 85 Updated Aug 28, 2025

AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.

Python 275 85 Updated Aug 18, 2025

Paper list for Efficient Reasoning.

736 27 Updated Nov 20, 2025

[Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances

HTML 21 3 Updated Nov 15, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,560 713 Updated Nov 28, 2025

❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents

Python 306 25 Updated Sep 28, 2025
Python 113 5 Updated Sep 5, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,332 445 Updated Nov 28, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,561 2,916 Updated Nov 27, 2025

A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"

HTML 166 15 Updated Nov 21, 2025

A general and accurate MACs / FLOPs profiler for PyTorch models

Python 630 43 Updated Jul 29, 2025

Expressive, Easy to Build, and High-Performance Application Networks

Go 18 6 Updated Jul 1, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 8,764 1,006 Updated Nov 25, 2025

The calflops is designed to calculate FLOPs、MACs and Parameters in all various neural networks, such as Linear、 CNN、 RNN、 GCN、Transformer(Bert、LlaMA etc Large Language Model)

Python 899 37 Updated Jun 27, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,932 286 Updated May 15, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,465 3,545 Updated Nov 28, 2025

Curated collection of papers in machine learning systems

463 31 Updated Nov 15, 2025

A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems

Python 220 11 Updated Jul 24, 2025

Large Language Model (LLM) Systems Paper List

1,639 87 Updated Nov 27, 2025

Microsoft Azure Traces

Jupyter Notebook 1,032 171 Updated Oct 20, 2025
Next