Stars
Frida scripts to rewrite mobile applications at runtime to directly MitM all HTTPS traffic
Electron wrapper to build and distribute HTTP Toolkit for the desktop
HTTP Toolkit is a beautiful & open-source tool for debugging, testing and building with HTTP(S) on Windows, Linux & Mac 🎉 Open an issue here to give feedback or ask for help.
Real (ish) time Chat TTS -> LLM -> STT Local
A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment…
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
🏡 Open source home automation that puts local control and privacy first.
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
SGLang is a fast serving framework for large language models and vision language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Large-scale LLM inference engine
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
A fast inference library for running LLMs locally on modern consumer-class GPUs
Universal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.
Lightweight coding agent that runs in your terminal
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
aider is AI pair programming in your terminal
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
The official API server for Exllama. OAI compatible, lightweight, and fast.
A Conversational Speech Generation Model
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…