Stars
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…
Training, validation, and inference code for various SSL approaches and architectures.
A reference implementation of the Resonate algorithm in C++ for Python.
Experimenting with Neural Amp Modeler models implemented with RTNeural
The sampler that dreams. AI-powered VST3 for real-time music generation. Generate tempo-synced loops, trigger via MIDI, sculpt the unexpected. 8-track sampler meets infinite sound engine. No pre-ma…
The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]
AFTER : Audio Features Transfer and Exploration in Real-time
Python module for creating macOS packages easily
Machine Learning inference engine for Microcontrollers and Embedded devices
Lets make video diffusion practical!
Official PyTorch implementation of StyleGAN3
A modern and transparent way to use Windows VST2, VST3 and CLAP plugins on Linux
Code repository of the paper "Variational Stochastic Gradient Descent for Deep Neural Networks" published at
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
Free in the Dark, a Alone in the Dark engine reimplementation.
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
Audio super resolution using neural networks
Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)
An extremely fast Python linter and code formatter, written in Rust.