Skip to content
View tscheepers's full-sized avatar

Organizations

@apple

Block or report tscheepers

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these factors with real speech and noise datasets.

Python 74 5 Updated Sep 29, 2025

Analyzing and Improving Speaker Similarity Assessment in Speech Synthesis

Python 11 3 Updated Jul 21, 2025

Speech Human Evaluation Estimation Toolkit (SHEET)

Python 129 10 Updated Oct 2, 2025

UTokyo-SaruLab MOS Prediction System

Python 285 28 Updated Dec 18, 2025

Evaluation code for the Interspeech publication "Towards Frame-level Quality Predictions of Synthetic Speech". Evaluate frame-level representations of MOS predictors.

Python 13 1 Updated Aug 15, 2025

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 413 35 Updated Feb 21, 2024

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 19,037 1,663 Updated Nov 19, 2025

A python package to build AI-powered real-time audio applications

Python 1,910 154 Updated Feb 12, 2025

A Conversational Speech Generation Model

Python 14,436 1,465 Updated May 27, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 659 49 Updated Jun 5, 2025

Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"

Python 49 5 Updated Jul 29, 2025

Repository for training models for music source separation.

Python 1,114 153 Updated Jan 7, 2026

Simple and fast HTTP framework for Mojo! 🔥🐝

Mojo 710 46 Updated Dec 10, 2025

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 412 60 Updated Apr 21, 2022

Fast trigram based code search

1,741 115 Updated Jan 16, 2024

Fork of Google codesearch with more options

Go 51 12 Updated Mar 24, 2025

Lightning fast code searching made easy

JavaScript 5,805 595 Updated Dec 12, 2025

Haptic input knob with software-defined endstops and virtual detents

C++ 21,270 1,225 Updated Feb 19, 2024

Layout algorithms for visualizing directed acyclic graphs

TypeScript 1,505 90 Updated Sep 7, 2025

This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi writt…

Python 56 10 Updated Aug 11, 2020

RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for rapid information retrieval and superior search accuracy.

Python 680 31 Updated Nov 2, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,670 2,109 Updated Oct 21, 2025

ReadingBank: A Benchmark Dataset for Reading Order Detection

115 4 Updated Aug 26, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,952 2,690 Updated Dec 15, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 38,902 4,681 Updated Aug 19, 2024

Temporal Python SDK

Python 929 147 Updated Jan 16, 2026

A C/C++ library for fast interval overlap queries (with a "bedtools coverage" example)

C 169 19 Updated May 28, 2024

Augmented Interval Tree implemented in Cython/C

C 20 1 Updated Jan 17, 2025
Next