Skip to content
View juanmc2005's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report juanmc2005

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

State-of-the-art TTS model under 25MB 😻

Python 9,160 464 Updated Aug 23, 2025

pyannoteAI Python SDK

Python 11 2 Updated Oct 10, 2025

OctoTools: An agentic framework with extensible tools for complex reasoning

Python 1,390 181 Updated Oct 11, 2025

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 156,682 13,764 Updated Nov 27, 2025

An android application that let you track your expenses

Java 624 143 Updated Aug 7, 2024

HTTP load generator, ApacheBench (ab) replacement

Go 19,531 1,273 Updated Aug 20, 2024

Minimalist ML framework for Rust

Rust 18,679 1,316 Updated Nov 25, 2025

On-device Speech Recognition for Apple Silicon

Swift 5,234 475 Updated Nov 26, 2025

Fast and memory-efficient exact attention

Python 20,778 2,170 Updated Nov 25, 2025

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

704 44 Updated Oct 16, 2025

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming…

Python 157 10 Updated Nov 17, 2025

LLM Chain querying a scientific Zotero library, with citations

Python 439 8 Updated Aug 4, 2023

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,598 232 Updated Oct 30, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 64,124 11,587 Updated Nov 28, 2025

Foundation Architecture for (M)LLMs

Python 3,121 222 Updated Apr 11, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,986 347 Updated Jan 8, 2025

The first real AI developer

Python 33,667 3,484 Updated Nov 10, 2025

Cross-Platform, GPU Accelerated Whisper 🏎️

TypeScript 1,805 83 Updated Feb 27, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,731 619 Updated Feb 21, 2025

A natural language interface for computers

Python 60,911 5,229 Updated Nov 26, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,714 1,166 Updated Nov 14, 2024

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 91 8 Updated Oct 18, 2023

Faster Whisper transcription with CTranslate2

Python 19,248 1,598 Updated Nov 19, 2025

Port of OpenAI's Whisper model in C/C++

C++ 44,756 4,966 Updated Nov 20, 2025

πŸΈπŸ’¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,625 5,802 Updated Aug 16, 2024

The Time Series Visualization Tool that you deserve.

C++ 5,500 730 Updated Nov 27, 2025

Tensor library for machine learning

C++ 13,630 1,411 Updated Nov 24, 2025

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 22,015 2,454 Updated Oct 2, 2025
Python 2,163 367 Updated Sep 6, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 38,176 4,137 Updated Nov 24, 2025
Next