- Munich, Germany
- https://scholar.google.de/citations?user=itIWDO8AAAAJ
Stars
Inference and deployment toolkit for Svara-TTS, an open-source multilingual text-to-speech model for Indic languages — includes examples for local GGUF inference, Gradio demo, and deployment guides.
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
A list of tools, papers and code related to Fake Audio Detection.
A comprehensive benchmark of deepfake detection
A list of tools, papers and code related to Deepfake Detection.
A list of publicly available room impulse response datasets and scripts to download them.
Baselines for IS25 Source Tracing Special Session
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
audioLIME: Listenable Explanations Using Source Separation
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license struggling.
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
A multi-voice TTS system trained with an emphasis on quality
This repository includes the code to reproduce our paper "End-to-end anti-spoofing with RawNet2" (https://arxiv.org/abs/2011.01108) published in ICASSP '21.
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
A high-level toolbox for using complex valued neural networks in PyTorch
A tool to create seperate exercise/solution files from a single .ipynb input notebook.
StyleGAN2-ADA - Official PyTorch implementation
A PyTorch model for Stanford Cars Datasets: https://ai.stanford.edu/~jkrause/cars/car_dataset.html
Learning computer vision by striving to maximise accuracy on the Stanford Cars dataset
Pytorch speech emotion recognition for RAVDESS dataset with CNN.