Skip to content
View haolongzhangm's full-sized avatar

Block or report haolongzhangm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Android's adb standalone build with cmake, support Linux(x86-64、arm64), Windows(32bit) and macOS!

C++ 98 25 Updated Aug 29, 2024
Python 1,192 157 Updated Nov 24, 2025

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

Cuda 3,064 318 Updated Jan 17, 2026

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Go 7,530 942 Updated Jan 16, 2026

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 1,907 158 Updated Jan 6, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 22,219 3,517 Updated Jan 5, 2026

zlib replacement with optimizations for "next generation" systems.

C 1,921 310 Updated Jan 17, 2026

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 5,182 504 Updated Jan 18, 2026

Karabiner-Elements is a powerful tool for customizing keyboards on macOS

C++ 21,351 889 Updated Jan 19, 2026

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,263 400 Updated Jan 16, 2026

pocl - Portable Computing Language

C 1,046 280 Updated Jan 14, 2026

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 23,025 2,790 Updated Jun 12, 2025

LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization

C++ 1,296 175 Updated Jan 19, 2026

Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖

Python 7,412 753 Updated Jan 19, 2026

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,825 1,107 Updated Jan 16, 2026

Towards Human-Sounding Speech

Python 5,891 506 Updated Dec 5, 2025

Spark-TTS Inference Code

Python 10,894 1,169 Updated Apr 9, 2025

A machine learning accelerator core designed for energy-efficient AI at the edge.

Emacs Lisp 1,996 223 Updated Jan 16, 2026

Userspace/GPU eBPF VM with llvm JIT/AOT compiler

C++ 124 14 Updated Nov 23, 2025
C++ 48 8 Updated Dec 16, 2025

Self-implemented NN operators for Qualcomm's Hexagon NPU

C 39 7 Updated Sep 30, 2025

On-device TTS model by Neuphonic

Python 4,555 485 Updated Jan 14, 2026
Vim Script 18 1 Updated Nov 6, 2025

Clash官网各版本Clash下载地址及备份下载地址

3,626 242 Updated Jan 1, 2026

Kernels & AI inference engine for mobile devices.

C++ 4,104 269 Updated Jan 18, 2026

MacOS Cross-Toolchain for Linux and *BSD

C++ 3,246 348 Updated Dec 15, 2025

Tools to set up a quick macOS VM in QEMU, accelerated by KVM.

Shell 13,889 1,143 Updated Apr 4, 2024

On-device AI across mobile, embedded and edge for PyTorch

Python 4,144 801 Updated Jan 19, 2026

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 2,112 89 Updated Dec 29, 2025

Fast Multimodal LLM on Mobile Devices

C++ 1,350 164 Updated Jan 17, 2026
Next