-
-
Amphion Public
Forked from open-mmlab/AmphionAmphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Python MIT License UpdatedDec 27, 2024 -
seed-vc Public
Forked from Plachtaa/seed-vczero-shot voice conversion & singing voice conversion with in context learning
-
GigaSpeech2 Public
Forked from SpeechColab/GigaSpeech2An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
Python Apache License 2.0 UpdatedSep 17, 2024 -
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUIEasily train a good VC model with voice data <= 10 mins!
Python MIT License UpdatedSep 5, 2024 -
XPhoneBERT Public
Forked from VinAIResearch/XPhoneBERTXPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)
Python MIT License UpdatedJul 22, 2024 -
Retrieval-based-Voice-Conversion Public
Forked from RVC-Project/Retrieval-based-Voice-Conversionin preparation...
Python MIT License UpdatedJul 12, 2024 -
AutoAWQ Public
Forked from casper-hansen/AutoAWQAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
Python MIT License UpdatedApr 26, 2024 -
coqui-ai-TTS Public
Forked from coqui-ai/TTS🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Python Mozilla Public License 2.0 UpdatedApr 8, 2024 -
-
vits_chinese Public
Forked from PlayVoice/vits_chineseBest practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Python MIT License UpdatedFeb 5, 2024 -
llama-recipes Public
Forked from meta-llama/llama-cookbookExamples and recipes for Llama 2 model
Jupyter Notebook Other UpdatedFeb 5, 2024 -
espnet Public
Forked from espnet/espnetEnd-to-End Speech Processing Toolkit
Python Apache License 2.0 UpdatedJan 30, 2024 -
vits Public
Forked from jaywalnut310/vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Python MIT License UpdatedJan 24, 2024 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python Other UpdatedJan 21, 2024 -
PaddleSpeech Public
Forked from PaddlePaddle/PaddleSpeechEasy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Python Apache License 2.0 UpdatedJan 16, 2024 -
FastSpeech2 Public
Forked from ming024/FastSpeech2An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Python MIT License UpdatedOct 27, 2023 -
vall-e Public
Forked from enhuiz/vall-eAn unofficial PyTorch implementation of the audio LM VALL-E
Python MIT License UpdatedMay 10, 2023