Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Voice to semantic #19

Open
huydung179 opened this issue Jun 27, 2023 · 3 comments
Open

Voice to semantic #19

huydung179 opened this issue Jun 27, 2023 · 3 comments

Comments

@huydung179
Copy link

If I well understood, you used a custom semantic-voice dataset for training your HuBERT model. Can you tell me how to create this dataset? Especially how to get the semantic from a voice? Many thanks for this work.

@gitmylo
Copy link
Owner

gitmylo commented Jun 27, 2023

The dataset creation code is up at https://github.com/gitmylo/bark-data-gen

To get the semantics from a voice, you have to use a trained HuBERT quantizer model. See a problem? It cannot be improved for a specific voice, because all you could train on, is previous outputs.

To understand why it works, you need to understand how bark works. https://github.com/gitmylo/audio-webui/wiki/how-bark-works
The quantizer model just converts recognized speech patterns into a format which bark understands, and is able to complete. Essentially cloning a voice.

@iamhch24
Copy link

iamhch24 commented Sep 1, 2023

Dear gitmylo, I also want to know how to create semantic data from wav source files.
I gather Korean wav files and I need to make semantic data from them, also need to pre-train both semantic data and wav files.
Could you explain about details.
I really appreciate your great job.

@gitmylo
Copy link
Owner

gitmylo commented Sep 1, 2023

If you want to train, you'll need a text dataset in the language you want to train for, you can modify the bark-data-gen code to load text files in another language for example. Then prepare the dataset, and train, as explained in https://github.com/gitmylo/bark-voice-cloning-HuBERT-quantizer#how-do-i-train-it-myself. And just follow the other steps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants