Stars
Record Audio from the User's Microphone in Apps that are Deployed to the Web. (via Browser Media-API, REACT-based, Streamlit Custom Component)
全中文注释.(The loss function of retinanet based on pytorch).(You can use it on one-stage detection task or classifical task, to solve data imbalance influence).用于one-stage目标检测算法,提升检测效果.你也可以在分类任务中使用该损失函…
library to read/write .npy and .npz files in C/C++
A pure python module for reading and writing kaldi ark files
An Open Source Machine Learning Framework for Everyone
Run TensorFlow models in C++ without installation and without Bazel
"Recurrent Models of Visual Attention" in TensorFlow
Deep neural network based speech enhancement toolkit
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
A tool/script for batch speech data enhancement with speed/volume/RIRS/MUSAN
Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN corpus.
Robust Speech Activity Detection (SAD) in movie audio
The codebase for Data-driven general-purpose voice activity detection.
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
Python interface to the WebRTC Voice Activity Detector
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Representation of Paper: On training targets for noise-robust voice activity detection.
Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).