Skip to content
View desword's full-sized avatar

Block or report desword

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

NOFX: Defining the Next-Generation AI Trading Operating System. A multi-exchange Al trading platform(Binance/Hyperliquid/Aster) with multi-Ai competition(deepseek/qwen/claude)self-evolution, and re…

Go 7,200 1,784 Updated Nov 9, 2025

OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Microsoft's Windows Recall. With OpenRecall, you can easily access your digital history, enhancing your me…

Python 2,517 152 Updated Sep 24, 2025
Jupyter Notebook 6 2 Updated May 24, 2017

Generate a timeline of your day, automatically

Swift 4,359 191 Updated Nov 9, 2025

Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…

JavaScript 7,388 745 Updated Sep 8, 2025

Selective Prompt Anchoring

Python 93 3 Updated Oct 21, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 194 13 Updated Jun 26, 2025

Sample cloud-first application with 10 microservices showcasing Kubernetes, Istio, and gRPC.

Go 19,300 9,104 Updated Oct 27, 2025

cluster data collected from production clusters in Alibaba for cluster management research

Jupyter Notebook 1,883 438 Updated Oct 13, 2025

Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.

Python 14,271 989 Updated Jul 31, 2025

APEX+ is an LLM Serving Simulator

Python 37 6 Updated Jun 16, 2025

Discovering Sparsity Allocation for Layer-wise Pruning of Large Language Models

Python 7 2 Updated Oct 29, 2024

Official PyTorch implementation of DLP: Dynamic Layerwise Pruning in Large Language Models(ICML'25)

Python 9 1 Updated Jun 4, 2025

A hybrid and high-performance layer-7 load balancing system.

C++ 10 1 Updated Nov 6, 2025

LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale

Python 148 29 Updated Jul 18, 2025

A language for constraint-guided and efficient LLM programming.

Python 4,078 214 Updated May 22, 2025

[AAAI 2025] Official Implementation of "Auto-Regressive Moving Diffusion Models for Time Series Forecasting"

Python 100 11 Updated Feb 9, 2025

One‑click codebase “blast” for Large‑Language‑Model workflows.

Vue 1,860 216 Updated Oct 20, 2025

P4runpro: Enabling Runtime Programmability for RMT Switches

P4 15 Updated Aug 26, 2024

RDMA exmaple

C 226 77 Updated May 31, 2022

Artifact evaluation repo for EuroSys'24.

Python 28 2 Updated Nov 7, 2023
C++ 720 121 Updated Oct 29, 2025

Build resilient language agents as graphs.

Python 20,774 3,667 Updated Nov 9, 2025

CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.

Python 22 Updated Dec 11, 2024

Lets make video diffusion practical!

Python 16,134 1,550 Updated Oct 16, 2025

This is the repository for Direct Telemetry Access, a high-speed network telemetry collection system.

P4 26 1 Updated Apr 6, 2025

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 721 81 Updated Apr 6, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,377 3,996 Updated Nov 6, 2025
Next