Skip to content
Change the repository type filter

All

    Repositories list

    • InspireMusic: A Unified Framework for Music, Song, Audio Generation.
      Python
      121000Updated May 9, 2025May 9, 2025
    • PDMX

      Public
      PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing
      Python
      4000Updated Oct 2, 2024Oct 2, 2024
    • lycon

      Public
      Python
      1000Updated Sep 1, 2024Sep 1, 2024
    • diarizers

      Public
      Python
      23000Updated Jun 14, 2024Jun 14, 2024
    • Awesome speech/audio LLMs, representation learning, and codec models
      71000Updated Apr 13, 2024Apr 13, 2024
    • Zero-Shot Speech Editing and Text-to-Speech in the Wild
      Jupyter Notebook
      797000Updated Mar 29, 2024Mar 29, 2024
    • Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
      Python
      2.5k000Updated Jun 25, 2023Jun 25, 2023
    • Port of OpenAI's Whisper model in C/C++
      C
      4.8k000Updated Feb 18, 2023Feb 18, 2023
    • Audio generation using diffusion models, in PyTorch.
      Python
      177000Updated Aug 17, 2022Aug 17, 2022
    • LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a very small number of parameters.
      Python
      53000Updated Mar 1, 2022Mar 1, 2022
    • NATSpeech

      Public
      A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)
      Python
      103000Updated Feb 17, 2022Feb 17, 2022
    • AutoEq

      Public
      Automatic headphone equalization from frequency responses
      Jupyter Notebook
      2.5k000Updated Dec 2, 2021Dec 2, 2021
    • soundata

      Public
      Python library for downloading, loading & working with sound datasets
      Python
      27000Updated Nov 24, 2021Nov 24, 2021
    • TTS

      Public
      🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
      Jupyter Notebook
      5.7k000Updated Sep 17, 2021Sep 17, 2021
    • praudio

      Public
      Audio preprocessing framework for Deep Learning audio applications
      Python
      10000Updated Aug 27, 2021Aug 27, 2021
    • Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
      Python
      115000Updated Jun 9, 2021Jun 9, 2021
    • Command-line tools for speech and intent recognition on Linux
      Python
      67000Updated May 21, 2021May 21, 2021
    • The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
      HTML
      31000Updated Mar 18, 2021Mar 18, 2021
    • A C++ standalone library for machine learning
      C++
      501000Updated Mar 4, 2021Mar 4, 2021
    • Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
      Python
      96000Updated Feb 24, 2021Feb 24, 2021
    • Spokestack is a library that allows a user to easily incorporate a voice interface into a Python application.
      Python
      14000Updated Jan 27, 2021Jan 27, 2021
    • pyttsx3

      Public
      Offline Text To Speech synthesis for python
      Python
      353000Updated Sep 30, 2020Sep 30, 2020
    • espnet

      Public
      End-to-End Speech Processing Toolkit
      Python
      2.3k000Updated Jun 5, 2020Jun 5, 2020
    • An implementation of a Convolutional Neural Network to Classify Music Genres
      Python
      9000Updated Sep 5, 2019Sep 5, 2019
    • implementation of music transformer with tensorflow-2.0 (ICLR2019)
      Python
      78000Updated Aug 12, 2019Aug 12, 2019
    • lmms

      Public
      Cross-platform music production software
      C++
      1.1k000Updated Apr 17, 2019Apr 17, 2019
    • snickery

      Public
      Hybrid speech synthesiser
      Python
      6000Updated Nov 6, 2018Nov 6, 2018
    • Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
      Python
      80000Updated Oct 5, 2018Oct 5, 2018
    • pytheory

      Public
      Music Theory for Humans.
      Python
      80000Updated Sep 10, 2018Sep 10, 2018
    • amodem

      Public
      Audio MODEM Communication Library in Python
      Python
      129000Updated Jun 18, 2018Jun 18, 2018