- Chicago, IL
- https://ming024.github.io/
Highlights
- Pro
Stars
Layer-wise analysis of self-supervised pre-trained speech representations
Official implementation of FCL-taco2: Fast, Controllable and Lightweight version of Tacotron2 @ ICASSP 2021
Self-Supervised Speech Pre-training and Representation Learning Toolkit
An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".
A Bootstrap 4 resume/CV theme created by Start Bootstrap
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model
A PyTorch implementation of the universal neural vocoder
Speaker embedding (d-vector) trained with GE2E loss
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
VAE Tacotron 2, an alternative of GST Tacotron
PyTorch implementation of Tacotron speech synthesis model.
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
DeepMind's Tacotron-2 Tensorflow implementation
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
The Implementation of FastSpeech based on pytorch.