Skip to content
View ggerganov's full-sized avatar

Sponsors

Organizations

@ggml-org

Block or report ggerganov

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A cosy home for your LLMs.

Swift 689 26 Updated Dec 22, 2025

Audio playback and capture library written in C, in a single source file.

C 5,720 491 Updated Dec 25, 2025

Local LLM-assisted text completion for Qt Creator.

C++ 39 2 Updated Dec 15, 2025

Simple GUI around whisper.cpp for voice-to-text on Linux

Python 55 8 Updated Sep 9, 2025

Local LLM-assisted text completion for Qt Creator.

C++ 52 7 Updated Dec 15, 2025

MLPerf Client is a benchmark for Windows, Linux and macOS, focusing on client form factors in ML inference scenarios.

C++ 63 4 Updated Nov 17, 2025

Kernels & AI inference engine for mobile devices.

C++ 3,906 248 Updated Dec 23, 2025

Emacs package for LLM-assisted code/text completion

Emacs Lisp 33 2 Updated Nov 12, 2025

Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk

Python 1,910 157 Updated Dec 22, 2025

The application performs real-time inference on audio from an ALSA capture device

C++ 37 1 Updated Jun 19, 2025

TTS support with GGML

C++ 202 27 Updated Oct 5, 2025
Python 532 55 Updated Oct 1, 2025

LLM plugin for interacting with llama-server models

Python 29 5 Updated May 28, 2025

Running any GGUF SLMs/LLMs locally, on-device in Android

Kotlin 617 93 Updated Dec 25, 2025

DINOv2 inference engine written in C/C++ using ggml and OpenCV.

C++ 83 7 Updated May 6, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,876 1,816 Updated Oct 13, 2025

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,174 838 Updated May 12, 2025

📎 Clippy, now with some AI

TypeScript 1,142 61 Updated Nov 15, 2025

Command to convert from color text (ANSI or 256) to image.

Go 254 21 Updated Dec 18, 2025

Simple frontend for LLMs built in react-native.

TypeScript 1,978 149 Updated Dec 8, 2025

Speech-to-text transcription VST3/ARA plugin

TypeScript 51 5 Updated Jul 4, 2025
Python 16 1 Updated Jul 12, 2025
Pascal 16 3 Updated Jul 11, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 154,240 31,537 Updated Dec 24, 2025

Notes and exploration code for learning about AI/ML

C++ 202 23 Updated Dec 24, 2025

Go language bindings for the ggwave C++ library

Go 14 1 Updated Apr 9, 2025

Rounding vectors

C 10 Updated Mar 28, 2025

Easy to use interface for the Whisper model optimized for all GPUs!

C++ 405 22 Updated Aug 2, 2025

Run Orpheus 3B Locally With LM Studio

Python 503 109 Updated Mar 20, 2025
Next