DelightfulTTS with Hifi-GAN and Univnet vocoders
-
Updated
Jun 4, 2024 - Jupyter Notebook
DelightfulTTS with Hifi-GAN and Univnet vocoders
TTS (FastPitch) for German
In this repo, I developed a step-by-step pipeline for a standard MultiSpeaker Text-to-Speech system 😄 In general, I used Portaspeech as an acoustic model and iSTFTNet as vocoder...
Catalan Text to Speech
Aligning latent space of speaking style with human perception using a re-embedding strategy
Python package for NSF and NSF-HiFi-GAN (unofficial)
포스코 청년 AI·Big Data 아카데미 - AI 프로젝트
This is the experimental description of MnTTS2.
Audio samples from "HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis"
Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.
PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
TTS models for Arabic (Tacotron2, FastPitch)
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate E2E-TTS
PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Add a description, image, and links to the hifi-gan topic page so that developers can more easily learn about it.
To associate your repository with the hifi-gan topic, visit your repo's landing page and select "manage topics."