Skip to content
View ahmed-fau's full-sized avatar

Block or report ahmed-fau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,517 237 Updated Sep 25, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 27,383 2,510 Updated Oct 12, 2025

Code for NeurIPS 2024 paper - The GAN is dead; long live the GAN! A Modern Baseline GAN - by Huang et al.

Python 823 44 Updated Jan 23, 2025
Jupyter Notebook 18 1 Updated Nov 8, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 663 96 Updated Aug 22, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,992 802 Updated Oct 9, 2025

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,685 470 Updated Oct 11, 2025

Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.

Python 1,090 96 Updated Jan 15, 2025

A novel human-interaction method for real-time speech extraction on headphones.

Python 585 66 Updated Jun 5, 2024

Fast and differentiable time domain all-pole filter in PyTorch.

Python 65 5 Updated Sep 16, 2025

Kolmogorov Arnold Networks

Jupyter Notebook 15,920 1,518 Updated Jan 19, 2025

Generative models for conditional audio generation

Python 3,460 386 Updated Oct 9, 2025

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 997 116 Updated Aug 7, 2024

Modern audio compression for the internet.

C 2,781 704 Updated Oct 8, 2025

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,586 159 Updated Oct 3, 2025

Official repo for consistency models.

Python 6,420 434 Updated Mar 22, 2024

Official implementation of SawSing (ISMIR'22)

Python 268 40 Updated Aug 28, 2022

List of Computer Science courses with video lectures.

70,019 9,406 Updated Oct 2, 2025

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

Python 190 30 Updated Dec 8, 2022

A PyTorch-based Speech Toolkit

Python 10,542 1,563 Updated Oct 9, 2025

List of speech synthesis papers.

1,061 122 Updated Jul 24, 2023
Jupyter Notebook 22 3 Updated Feb 25, 2020

Official implementation of "Implicit Neural Representations with Periodic Activation Functions"

Python 1,889 265 Updated Jul 27, 2024

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook 1,621 347 Updated Apr 22, 2024

End-to-End Speech Processing Toolkit

Python 9,506 2,332 Updated Oct 8, 2025

Authors' implementation of DeepSpeech Distances.

Jupyter Notebook 130 12 Updated May 5, 2020

Master programming by recreating your favorite technologies from scratch.

Markdown 426,060 39,974 Updated Oct 10, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 40,370 4,576 Updated Oct 13, 2025

PyTorch implementations of Generative Adversarial Networks.

Python 17,288 4,094 Updated Jun 18, 2024

Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch

C 489 58 Updated Oct 28, 2023
Next