Replies: 1 comment 2 replies
-
|
Is YourTTS going the direction you are looking for? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Amazing things can be done with the TTS stuff, but what we really need is the ability to train our a voice from one source and then overlay it on performance.
Altered.ai is now offering that, but starting at $160 USD a month, for a 3 month minimum commitment, and going UP from there, they have ensured that ONLY the bad actors have access to this technology.
I don't know about you, but I don't have $500 to commit a project, with yet another gatekeeper telling me that THEIR ethics might allow me to use their tech for my villain, or even my hero!!!
We need the FOSS community to break the open and democratize this tech so it's THEIR Genie in THEIR bottle.
It's really only with the framework of THIS kind of performance technology that the TTS abilities make sense. No matter whose voice is currently being used to speak text, the text is still missing too much tone, cadence, and authenticity to work as a performance.
A character "Font" needs to be able to recreate an audio performance, and then tweek it though TTS, so the TTS portion is limited to fixing a few lines. I would also like to see these voice "Fonts" be downloadable by anyone so we can start a library of virtual actors.
Ideally, those who don't have the equipment to create the Voice Font would still be able use them once the training data set is done, without we access or network. A Font needs to be a completely local resource.
Is there any chance this project could move in this direction? Or is any one doing this research working on THIS angle?
https://www.altered.ai/
https://youtu.be/AALf9w37COM
Beta Was this translation helpful? Give feedback.
All reactions