TransferTTS (Zero-shot VITS) - PyTorch Implementation (-Ongoing-)

Note!!(09.23.)

In current, this is just a implementation of zero-shot system; Not the implementation of the first contribution of the paper: Transfer learning framework using wav2vec2.0. As the future work, the model equipped with complete implementations of the two contributions (zero-shot and transfer-learning) will be implemented in the follwoing repository. Congratulations on being awarded the best paper in INTERSPEECH 2022.

Overview

Unofficial PyTorch Implementation of Transfer Learning Framework for Low-Resource Text-to-Speech using a Large-Scale Unlabeled Speech Corpus. Most of codes are based on VITS

MelStyleEncoder from StyleSpeech is used instead of the reference encoder.
Implementation of untranscribed data training is omitted.
LibriTTS dataset (train-clean-100 and train-clean-360) is used. Sampling rate is set to 22050Hz.

Pre-requisites (from VITS)

Python >= 3.6
Clone this repository
Install python requirements. Please refer requirements.txt
1. You may need to install espeak first: apt-get install espeak
Build Monotonic Alignment Search and run preprocessing if you use your own datasets.

# Cython-version Monotonoic Alignment Search
cd monotonic_align
python setup.py build_ext --inplace

Preprocessing

Run

python prepare_wav.py --data_path [LibriTTS DATAPATH]

for some preparations.

Training

Train your model with

python train_ms.py -c configs/libritts.json -m libritts_base

Inference

python inference.py --ref_audio [REF AUDIO PATH] --text [INPUT TEXT]

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
configs		configs
filelists		filelists
img		img
monotonic_align		monotonic_align
text		text
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
attentions.py		attentions.py
commons.py		commons.py
data_utils.py		data_utils.py
inference.py		inference.py
libritts.py		libritts.py
losses.py		losses.py
mel_processing.py		mel_processing.py
models.py		models.py
modules.py		modules.py
prepare_wav.py		prepare_wav.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
train_ms.py		train_ms.py
transforms.py		transforms.py
utils.py		utils.py

License

hcy71o/TransferTTS

Folders and files

Latest commit

History

Repository files navigation

TransferTTS (Zero-shot VITS) - PyTorch Implementation (-Ongoing-)

Note!!(09.23.)

Overview

Pre-requisites (from VITS)

Preprocessing

Training

Inference

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages