F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization

This is a simplified implementation of F5R-TTS based on the paper F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization, intended for learning purposes.

Fig 1: The architecture of backbone.

Fig 2: The pipeline of GRPO phase.

Installation

# Create a python 3.10 conda env (you could also use virtualenv)
conda create -n f5r-tts python=3.10
conda activate f5r-tts
pip install -r requirements.txt

Inference

python ./src/f5_tts/infer/infer_cli.py \
  --model F5TTS_v1_Base \
  --ckpt_file "your_model_path" \
  --ref_audio "path_to_reference.wav" \
  --ref_text "reference_text" \
  --gen_text "generated_text" \
  --output_dir ./tests

Training

You need to download wespeaker pretrained model and put it under src/rl/wespeaker/multilingual directory for GRPO phase.

accelerate config

# Data preparing
python src/f5_tts/train/datasets/prepare_libritts.py

# Pretraining phase
accelerate launch src/f5_tts/train/train.py

# GRPO phase
accelerate launch src/f5_tts/train/train_rl.py

Name		Name	Last commit message	Last commit date
Latest commit History 280 Commits
.github/workflows		.github/workflows
data		data
resource		resource
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization

Installation

Inference

Training

About

Uh oh!

Releases

Packages

Languages

License

limei1221/F5R-TTS

Folders and files

Latest commit

History

Repository files navigation

F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization

Installation

Inference

Training

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages