zhaohb

Follow

zhaohongbo zhaohb

Follow

25 followers · 56 following

Intel
shanghai

Achievements

Achievements

Starred repositories

mit-han-lab / x-attention

[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring

Python 242 14 Updated Jul 6, 2025

ROCm / rocSHMEM

rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.

C++ 121 35 Updated Oct 25, 2025

mashriram / jet_nemotron_lib

Python 9 1 Updated Aug 28, 2025

ROCm / rocFFT

[DEPRECATED] Moved to ROCm/rocm-libraries repo

C++ 198 95 Updated Oct 24, 2025

fastrepl / hyprnote

Local-first AI Notepad for Private Meetings

TypeScript 6,417 396 Updated Oct 26, 2025

mediar-ai / screenpipe

AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording

TypeScript 15,820 1,235 Updated Sep 1, 2025

mcp-use / mcp-use

mcp-use is the easiest way to interact with mcp servers with custom agents

TypeScript 8,063 943 Updated Oct 26, 2025

openvinotoolkit / npu_compiler

OpenVINO Intel NPU Compiler

MLIR 73 31 Updated Oct 20, 2025

usstq / aboutSHW

mysteries about Hardware in software engineer's eyes

C++ 5 9 Updated Oct 21, 2025

openai / openai-realtime-agents

This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.

TypeScript 6,553 1,011 Updated Oct 8, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,129 1,656 Updated Sep 24, 2025

zhaohb / MeloTTS-OV

Using OpenVINO to speed up MeloTTS inference

Python 13 3 Updated Nov 1, 2024

zhaohb / InternVL2-4B-OV

Python 3 Updated Oct 16, 2024

zhaohb / moondream2-ov

Using OpenVINO to speed up moondream2 inference

Python 4 Updated Oct 14, 2024

zhaohb / fastapi_tritonserver

Python 27 6 Updated Nov 6, 2024

zhaohb / TTS-OV

Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 2 2 Updated Aug 6, 2024

huggingface / optimum-intel

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook 503 149 Updated Oct 24, 2025

yuanjiechen / trt_final

C++ 7 2 Updated Sep 22, 2023

Tlntin / Qwen-TensorRT-LLM

Python 625 57 Updated Jul 31, 2024

Tlntin / ChatGLM2-6B-TensorRT

Python 90 10 Updated Jun 30, 2023

tw93 / Pake

🤱🏻 Turn any webpage into a desktop app with one command. 一键打包网页生成轻量桌面应用

Rust 43,120 8,236 Updated Oct 23, 2025

wangzhaode / mnn-llm

llm deploy project based mnn. This project has merged into MNN.

C++ 1,606 176 Updated Jan 20, 2025

triton-inference-server / python_backend

Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.

C++ 649 183 Updated Oct 15, 2025

NVIDIA / TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 12,272 2,258 Updated Sep 24, 2025

tlc-pack / relax

Python 193 56 Updated Mar 28, 2023

alibaba / BladeDISC

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

C++ 902 170 Updated Dec 30, 2024

openvinotoolkit / openvino

OpenVINO™ is an open source toolkit for optimizing and deploying AI inference

C++ 9,104 2,770 Updated Oct 26, 2025

flyteorg / flyte

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 6,551 753 Updated Oct 25, 2025

OAID / AutoKernel

AutoKernel 是一个简单易用，低门槛的自动算子优化工具，提高深度学习算法部署效率。

C++ 742 81 Updated Sep 23, 2022

triton-inference-server / tensorrt_backend

The Triton backend for TensorRT.

C++ 79 33 Updated Oct 10, 2025

Starred topics

ctr-prediction