Skip to content
View gabrielmittag's full-sized avatar

Block or report gabrielmittag

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 99 7 Updated Aug 1, 2025

A video quality MOS prediction model for videoconferencing calls that takes temporal distortions into account

Python 47 11 Updated Mar 17, 2025

Repository for Reinforcement learning based bandwidth estimation challenge

Python 36 6 Updated Oct 9, 2024

The ITU-T Software Tool Library (G.191)

C 91 26 Updated Sep 2, 2024

Open source reference implementation of ITU-T P.1204.3

Python 66 14 Updated Sep 2, 2025

Video Quality Metrics

MATLAB 46 10 Updated Aug 12, 2024

Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters

Jupyter Notebook 3,498 584 Updated May 25, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,473 519 Updated Jun 13, 2025

A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.

Python 2,180 205 Updated Sep 26, 2025

AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

Python 32 3 Updated Jun 13, 2022

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Jupyter Notebook 10,389 2,215 Updated Jan 3, 2024

Bias-Aware Loss for Training Image and Speech Quality Prediction Models from Multiple Dataset

Python 6 2 Updated Apr 22, 2021

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

2,846 736 Updated May 19, 2023

MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement (ICML 2019, with Travel awards)

MATLAB 146 36 Updated Apr 19, 2021

An open-source framework for modeling real-time conversations in spoken dialogue systems.

Python 27 5 Updated Aug 12, 2022

A PyTorch-based Speech Toolkit

Python 10,831 1,610 Updated Nov 24, 2025

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,369 149 Updated Jun 6, 2024

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,082 253 Updated Jun 6, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 1,020 171 Updated Jul 5, 2023

Perceptual Quality Estimator for speech and audio

C++ 837 140 Updated May 17, 2025

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…

HTML 4 3 Updated Sep 28, 2023

Subjective quality scores recovery from noisy measurements.

Python 132 33 Updated Aug 25, 2023

Latex code for making neural networks diagrams

TeX 24,135 3,026 Updated Aug 21, 2023

Deep Learning based Quality metric for gaming content

Python 9 1 Updated Aug 18, 2021

Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"

Python 376 65 Updated Jul 21, 2024

Reference Implementations of Waveform Evaluation Networks (WEnets)

Python 25 6 Updated Sep 18, 2023

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 891 144 Updated Dec 1, 2024

This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Ama…

HTML 226 60 Updated Oct 23, 2025
Next