Stars
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
A Python library for Real-time Music Alignment
TheGlueNote is representation model for note-wise music alignment.
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
Accompanying repository for our ISMIR 2025 article "Exploring System Adaptations for Minimum Latency Real-Time Piano Transcription"
Clarity Challenge toolkit - software for building Clarity Challenge systems
Repository for training models for music source separation.
A reference implementation of the Resonate algorithm in C++ for Python.
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
ACE-Step: A Step Towards Music Generation Foundation Model
C++ classes for reading/writing Standard MIDI Files
A modern C++ MIDI 1 / MIDI 2 real-time & file I/O library. Supports Windows, macOS, Linux and WebMIDI.
Framework for differentiable black-box and gray-box audio effects modeling
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture (hFT-Transformer).
A fast single-producer, single-consumer lock-free queue for C++
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
Unaligned Supervision for Automatic Music Transcription in The Wild
A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.
multi-task and multi-track music transcription for everyone
Unofficial implementation of SpecTNT in pytorch
Z.Wang & G.Xia, MuseBERT: Pre-training of Music Representation for Music Understanding and Controllable Generation, ISMIR 2021
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Python audio and music signal processing library
C++ polyphonic pitch/time library (GitHub mirror)