Skip to content
View hbredin's full-sized avatar

Highlights

  • Pro

Organizations

@tvd-dataset @camomile-project @pyannote

Block or report hbredin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

ElevenLabs UI is a component library and custom registry built on top of shadcn/ui to help you build multimodal agents faster.

TypeScript 1,380 100 Updated Nov 3, 2025

Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.

Python 950 69 Updated Nov 10, 2025

pyannoteAI Python SDK

Python 10 1 Updated Oct 10, 2025

Mamba SSM architecture

Python 16,379 1,484 Updated Oct 10, 2025

Official Repository For VoxBlink2

Python 84 6 Updated Aug 13, 2024

Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.

Svelte 68 9 Updated Oct 21, 2025

Package to play with active learning-like subset selection in pyannote.

Jupyter Notebook 1 1 Updated Sep 20, 2024

Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024

HTML 3 Updated Jul 16, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,804 247 Updated Aug 1, 2024

Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automatic speech recognition

Python 33 4 Updated Jun 14, 2024

Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings" published at Odyssey 2024

Python 97 4 Updated Jan 10, 2025

plotting on terminal

Python 2,033 91 Updated Sep 24, 2024

Speech-to-text in Obsidian using OpenAI Whisper

TypeScript 312 55 Updated Mar 2, 2024

Performance-portable, length-agnostic SIMD with runtime dispatch

C++ 5,127 388 Updated Nov 5, 2025
Python 65 3 Updated Feb 8, 2024

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,977 345 Updated Jan 8, 2025

Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.

Python 125 12 Updated Sep 25, 2023

C++ fast hierarchical clustering algorithms

C++ 89 18 Updated Jun 13, 2023

Official implementation of "Separate Anything You Describe"

Python 1,834 138 Updated Nov 26, 2024

Cross-Platform, GPU Accelerated Whisper 🏎️

TypeScript 1,805 83 Updated Feb 27, 2024

Voice Conversion With Just Nearest Neighbors

Python 501 70 Updated Mar 18, 2024

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Shell 58 2 Updated Sep 19, 2024

Track and predict the energy consumption and carbon footprint of training deep learning models.

Python 465 36 Updated Sep 24, 2025

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 91 8 Updated Oct 18, 2023

MeetEval - A meeting transcription evaluation toolkit

Python 116 15 Updated Oct 2, 2025
JavaScript 113 33 Updated Jan 8, 2023

A custom micropython firmware integrating tensorflow lite for microcontrollers and ulab to implement the tensorflow micro examples.

C 191 94 Updated Feb 18, 2025

Behavioral probing of language acquisition models at the lexical and syntactic level

Python 17 1 Updated Jul 17, 2023
Next