Skip to content
View xiaguan's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report xiaguan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

DeepEP: an efficient expert-parallel communication library

Cuda 8,702 979 Updated Nov 6, 2025

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 913 80 Updated Nov 9, 2025

magic-trace collects and displays high-resolution traces of what a process is doing

OCaml 5,139 114 Updated Oct 25, 2025

"AI-Trader: Can AI Beat the Market?" Live Trading Bench: https://hkuds.github.io/AI-Trader/

Python 9,142 1,343 Updated Nov 8, 2025

Production-Grade Container Scheduling and Management

Go 118,484 41,679 Updated Nov 8, 2025

Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.

Python 34,981 4,590 Updated Nov 8, 2025

Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and improve overall performance.

Rust 10 2 Updated Oct 27, 2025

GPUd automates monitoring, diagnostics, and issue identification for GPUs

Go 449 55 Updated Nov 8, 2025

This is the official implementation for **"AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!**".

Python 80 4 Updated Oct 16, 2025

The best ChatGPT that $100 can buy.

Python 36,174 4,231 Updated Nov 5, 2025

An transformer based LLM. Written completely in Rust

Rust 2,949 243 Updated Oct 10, 2025

Burn is a next generation tensor library and Deep Learning Framework that doesn't compromise on flexibility, efficiency and portability.

Rust 13,377 731 Updated Nov 7, 2025

A workload for deploying LLM inference services on Kubernetes

Go 99 26 Updated Nov 7, 2025

DAOS Storage Stack (client libraries, storage engine, control plane)

C 892 333 Updated Nov 9, 2025

A modern high-performance open source message queuing system

C++ 3,036 155 Updated Nov 8, 2025

An exabyte-scale, multi-region distributed file system

C++ 1,211 73 Updated Nov 7, 2025

nanobind: tiny and efficient C++/Python bindings

C++ 3,121 257 Updated Nov 7, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 808 62 Updated Nov 4, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,998 1,664 Updated Nov 8, 2025
TypeScript 378 31 Updated Oct 16, 2025

Python quantitative trading strategies including VIX Calculator, Pattern Recognition, Commodity Trading Advisor, Monte Carlo, Options Straddle, Shooting Star, London Breakout, Heikin-Ashi, Pair Tra…

Python 8,546 1,601 Updated Apr 14, 2024

Free, open source crypto trading bot

Python 44,431 9,118 Updated Nov 9, 2025

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 33,523 5,165 Updated Nov 6, 2025

Offline optimization of your disaggregated Dynamo graph

Python 101 25 Updated Nov 9, 2025

An external log connector example for LMCache

Python 4 Updated Jun 13, 2025

A lightweight, powerful framework for multi-agent workflows

Python 17,194 2,833 Updated Nov 8, 2025

Renderer for the harmony response format to be used with gpt-oss

Rust 3,988 223 Updated Nov 5, 2025

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 912 44 Updated Oct 29, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,077 1,850 Updated Nov 9, 2025
Next