How to correct the pronunciation of polyphones and output subtitle files with timestamps #959
Unanswered
single1225
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I deployed f5-tts locally. When performing speech synthesis, two problems bothered me.
First, I want to correct the pronunciation of Chinese polyphones. Is this function possible? If so, how to implement it in Python code? I haven't seen any relevant documents.
Second, is there any way to output subtitle files with timestamps while outputting audio?
Thank you very much for your help
Beta Was this translation helpful? Give feedback.
All reactions