Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

音色迁移的问题 #75

Open
IndowK opened this issue Apr 23, 2023 · 4 comments
Open

音色迁移的问题 #75

IndowK opened this issue Apr 23, 2023 · 4 comments

Comments

@IndowK
Copy link

IndowK commented Apr 23, 2023

作者您好,我选择了vctk数据集中p225到p246共二十个说话人的语音进行训练(包括有p226和p231),模型迭代到十万次左右损失不再下降(31左右),但是使用模型时,我发现仅有音高和韵律进行了转换,而音色没有变化,且转换音色生成的语音质量很差。我继续迭代到二十万次,损失没有下降,效果与迭代十万次的相同,仅有韵律和音高有转换,音色转换效果很差。我想知道这可能是为什么?我应该继续训练迭代到六十多万次吗?

@auspicious3000
Copy link
Owner

可能是需要调bottleneck吧,虽然都是vctk,但选择的训练数据不一样还是有可能需要调bottleneck的

@IndowK
Copy link
Author

IndowK commented Apr 23, 2023

我的代码基础比较弱,我想请问一下,调节bottleneck的参数是指调节hparams.py中的 dim_neck, dim_neck_2, dim_neck_3吗?

@IndowK
Copy link
Author

IndowK commented Apr 23, 2023

我还有一个猜测,是不是我没有将speakerID加进来?
image

@auspicious3000
Copy link
Owner

speaker id 是要加的

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants