Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Stars
DeepEP: an efficient expert-parallel communication library
Perplexity open source garden for inference technology
A modular graph-based Retrieval-Augmented Generation (RAG) system
Curated collection of papers in machine learning systems
Large Language Model (LLM) Systems Paper List
Introduction to Machine Learning Systems
A curated list of awesome smartnic tutorials, papers and projects.
C++高性能分布式服务器框架,webserver,websocket server,自定义tcp_server(包含日志模块,配置模块,线程模块,协程模块,协程调度模块,io协程调度模块,hook模块,socket模块,bytearray序列化,http模块,TcpServer模块,Websocket模块,Https模块等, Smtp邮件模块, MySQL, SQLite3, ORM,Red…
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
GoogleTest - Google Testing and Mocking Framework
A high performance and generic framework for distributed DNN training
AMD TCPDirect ultra low latency kernel bypass TCP and UDP implementation for AMD Solarflare network adapters, to be used with corresponding versions of Onload®️ at https://github.com/Xilinx-CNS/onl…
bytedance / ps-lite
Forked from dmlc/ps-liteA lightweight parameter server interface
An open protocol enabling communication and interoperability between opaque agentic applications.
A Model Context Protocol (MCP) server implementation that provides network control and management capabilities through the ONOS SDN controller.
Mirror of the OpenDaylight controller gerrit project
A Model Context Protocol (MCP) server for the POX SDN controller
Market maker keeper for the Polymarket CLOB
Landing page for Software for Open Networking in the Cloud (SONiC) - https://sonic-net.github.io/SONiC/
Resource-adaptive cluster scheduler for deep learning training.
Tiresias is a GPU cluster manager for distributed deep learning training.