Skip to content
View zhaohb's full-sized avatar

Block or report zhaohb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 242 14 Updated Jul 6, 2025

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 121 35 Updated Oct 25, 2025
Python 9 1 Updated Aug 28, 2025

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 198 95 Updated Oct 24, 2025

Local-first AI Notepad for Private Meetings

TypeScript 6,417 396 Updated Oct 26, 2025

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

TypeScript 15,820 1,235 Updated Sep 1, 2025

mcp-use is the easiest way to interact with mcp servers with custom agents

TypeScript 8,063 943 Updated Oct 26, 2025

OpenVINO Intel NPU Compiler

MLIR 73 31 Updated Oct 20, 2025

mysteries about Hardware in software engineer's eyes

C++ 5 9 Updated Oct 21, 2025

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 6,553 1,011 Updated Oct 8, 2025

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,129 1,656 Updated Sep 24, 2025

Using OpenVINO to speed up MeloTTS inference

Python 13 3 Updated Nov 1, 2024
Python 3 Updated Oct 16, 2024

Using OpenVINO to speed up moondream2 inference

Python 4 Updated Oct 14, 2024

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2 2 Updated Aug 6, 2024

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook 503 149 Updated Oct 24, 2025
C++ 7 2 Updated Sep 22, 2023
Python 625 57 Updated Jul 31, 2024
Python 90 10 Updated Jun 30, 2023

🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用

Rust 43,120 8,236 Updated Oct 23, 2025

llm deploy project based mnn. This project has merged into MNN.

C++ 1,606 176 Updated Jan 20, 2025

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++ 649 183 Updated Oct 15, 2025

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,272 2,258 Updated Sep 24, 2025
Python 193 56 Updated Mar 28, 2023

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 902 170 Updated Dec 30, 2024

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 9,104 2,770 Updated Oct 26, 2025

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 6,551 753 Updated Oct 25, 2025

AutoKernel 是一个简单易用,低门槛的自动算子优化工具,提高深度学习算法部署效率。

C++ 742 81 Updated Sep 23, 2022

The Triton backend for TensorRT.

C++ 79 33 Updated Oct 10, 2025
Next