Dongrui Dongru1

🎯

Focusing

0 followers · 5 following

Highlights

audio_feature_extractor Public

Python MIT License Updated Oct 24, 2025
Amphion Public
Forked from open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python MIT License Updated Dec 27, 2024
seed-vc Public
Forked from Plachtaa/seed-vc

zero-shot voice conversion & singing voice conversion with in context learning

Python 1 GNU General Public License v3.0 Updated Oct 30, 2024
GigaSpeech2 Public
Forked from SpeechColab/GigaSpeech2

An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement

Python Apache License 2.0 Updated Sep 17, 2024
Retrieval-based-Voice-Conversion-WebUI Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python MIT License Updated Sep 5, 2024
XPhoneBERT Public
Forked from VinAIResearch/XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python MIT License Updated Jul 22, 2024
Retrieval-based-Voice-Conversion Public
Forked from RVC-Project/Retrieval-based-Voice-Conversion

in preparation...

Python MIT License Updated Jul 12, 2024
AutoAWQ Public
Forked from casper-hansen/AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python MIT License Updated Apr 26, 2024
coqui-ai-TTS Public
Forked from coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python Mozilla Public License 2.0 Updated Apr 8, 2024
dongru1.github.io Public

JavaScript MIT License Updated Mar 19, 2024
vits_chinese Public
Forked from PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python MIT License Updated Feb 5, 2024
llama-recipes Public
Forked from meta-llama/llama-cookbook

Examples and recipes for Llama 2 model

Jupyter Notebook Other Updated Feb 5, 2024
espnet Public
Forked from espnet/espnet

End-to-End Speech Processing Toolkit

Python Apache License 2.0 Updated Jan 30, 2024
vits Public
Forked from jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python MIT License Updated Jan 24, 2024
llama Public
Forked from meta-llama/llama

Inference code for LLaMA models

Python Other Updated Jan 21, 2024
PaddleSpeech Public
Forked from PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python Apache License 2.0 Updated Jan 16, 2024
FastSpeech2 Public
Forked from ming024/FastSpeech2

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python MIT License Updated Oct 27, 2023
vall-e Public
Forked from enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python MIT License Updated May 10, 2023

Dongrui Dongru1

Highlights

audio_feature_extractor Public

Uh oh!

Amphion Public

Uh oh!

seed-vc Public

Uh oh!

GigaSpeech2 Public

Uh oh!

Retrieval-based-Voice-Conversion-WebUI Public

Uh oh!

XPhoneBERT Public

Uh oh!

Retrieval-based-Voice-Conversion Public

Uh oh!

AutoAWQ Public

Uh oh!

coqui-ai-TTS Public

Uh oh!

dongru1.github.io Public

Uh oh!

vits_chinese Public

Uh oh!

llama-recipes Public

Uh oh!

espnet Public

Uh oh!

vits Public

Uh oh!

llama Public

Uh oh!

PaddleSpeech Public

Uh oh!

FastSpeech2 Public

Uh oh!

vall-e Public

Uh oh!