spin-for-rvc

This project replaces contentvec with spin for disentangling speaker information.

Download a spin hubert checkpoint here (Offical checkpoints have timbre bleed in RVC, it is recommend to download my trained model below)

Run convert_lighting.py to convert the spin pylightning checkpoint to a standard pytorch .pth file

Run convert_spin_to_transformers.py to convert the new spin.pth checkpoint for a drop in replacement for RVC.

Pretrain's need to be finetuned again after extracting features with spin.

UPDATE:

The official checkpoints with layer 11 and 12 trained have timbre bleed. The current way to fix this is by training transformer layers 7-12 instead. Below is the currently accepted checkpoint for RVC and currently used on AIhub models. This model is trained based on librespeech 400+ hour dataset.

~~https://huggingface.co/dr87/spin-for-rvc/resolve/main/spin_layers_7_12.zip~~

Spinv2 is here. Timbre bleed is fixed, as well as slightly slurred speech with shorter datasets. Pronunciation should be more clear, and codebook has been reduced to 1024 for a number of reasons, as it just peforms better in RVC.

https://huggingface.co/dr87/spinv2_rvc/blob/main/spinv2.zip

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
LICENSE		LICENSE
README.md		README.md
convert_lighting.py		convert_lighting.py
convert_spin_to_transformers.py		convert_spin_to_transformers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

spin-for-rvc

This project replaces contentvec with spin for disentangling speaker information.

UPDATE:

About

Uh oh!

Releases 1

Packages

Languages

License

dr87/spin-for-rvc

Folders and files

Latest commit

History

Repository files navigation

spin-for-rvc

This project replaces contentvec with spin for disentangling speaker information.

UPDATE:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages