Skip to content
View inisis's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report inisis

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework

C++ 1,673 199 Updated Dec 27, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 13,231 1,558 Updated Dec 17, 2025

A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.

C++ 18 2 Updated Dec 27, 2025

A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.

C++ 15 3 Updated Dec 23, 2025

A collection of practical, end-to-end AI application examples accelerated by MemryX hardware and software solutions. This repository offers examples for real-time video inference, object detection…

Python 3 Updated Dec 13, 2025

JAX bindings for Flash Attention v2

C++ 102 8 Updated Dec 29, 2025

Tokamax: A GPU and TPU kernel library.

Python 144 6 Updated Dec 27, 2025

Customizable Reinforcement Learning

Python 182 16 Updated Dec 28, 2025

A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks …

Python 1,745 228 Updated Jan 1, 2026

The repository provides code for running inference with the Meta Segment Anything Model 3 (SAM 3).

Python 40 6 Updated Dec 21, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

C++ 4,349 367 Updated Jan 1, 2026

SAM 3D Objects

Python 5,196 504 Updated Dec 31, 2025

Ultralytics YOLO 🚀

Python 50,574 9,763 Updated Jan 1, 2026

The best ChatGPT that $100 can buy.

Python 39,560 5,038 Updated Dec 31, 2025

A Toolkit to Help Optimize Onnx Model

Python 289 27 Updated Dec 31, 2025
1 Updated Aug 28, 2025

Multi-stream video inference with Ultralytics YOLO - Display multiple video streams in a grid layout with real-time object detection.

Python 12 1 Updated Dec 30, 2025

🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime

Python 107 33 Updated Dec 23, 2025

A high-performance tool for video upscaling, interpolation, depth estimation, and more. Available as a CLI and Adobe Extension.

Python 224 7 Updated Dec 31, 2025

🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization tools

Python 3,230 613 Updated Dec 19, 2025

Use safetensors with ONNX 🤗

Python 78 5 Updated Oct 1, 2025

mnn tts demo.

C++ 18 2 Updated May 7, 2025

mnn asr demo.

C++ 23 2 Updated Mar 24, 2025

llm deploy project based onnx.

C++ 48 8 Updated Oct 9, 2024

caffe model to onnx

Python 34 12 Updated Nov 16, 2022

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 11,168 1,175 Updated Mar 14, 2025

Verifile

Rust 1 Updated Aug 10, 2024

TypeScript 44 10 Updated Oct 12, 2025
Next