How can I train a 48khz or a 44.1khz VITS model? #3987
Replies: 1 comment 3 replies
-
|
你成功了吗,我也想训练48khz |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I'm having problem training at a hight sampling rate, I'm not very good at code but I thought that it is just changing some parameter and ask chatgpt, but I'm faced problems like n_mel is wrong there is a mismatch, tensor a must be the same as tensor b. I couldn't find any help or the way I'm trying is wrong.
And also I have some questions to the VITS training, first thing is that I don't know how to continue a training from a checkpoint, I let chatgpt if the code has some hints but I didn't managed to continue a training, but actually it isn't very important as I feel my computer pretty consistant but sometimes windows shuts down to update...
The output model is a single .pth, I've used VITS models and they were G_xxxxx.pth and a D_xxxxx.pth and both weren't this big. I trained a 22050hz model to try, and I failed to load that model into my program, and maybe it's the config.json's fault as it is very different to others vits models I've tried...
Also I left default the phoneme language: phoneme_language="en-us", as I'm training for a person that speaks en-us I didn't change.
The code i used to train 22050hz is the default and I hope someone teach me how to train at 48khz or 44.1khz :)
Beta Was this translation helpful? Give feedback.
All reactions