How to correct the pronunciation of polyphones and output subtitle files with timestamps #959

single1225 · 2025-04-08T07:21:10Z

single1225
Apr 8, 2025

I deployed f5-tts locally. When performing speech synthesis, two problems bothered me.
First, I want to correct the pronunciation of Chinese polyphones. Is this function possible? If so, how to implement it in Python code? I haven't seen any relevant documents.
Second, is there any way to output subtitle files with timestamps while outputting audio?

Thank you very much for your help

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to correct the pronunciation of polyphones and output subtitle files with timestamps #959

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

How to correct the pronunciation of polyphones and output subtitle files with timestamps #959

Uh oh!

single1225 Apr 8, 2025

Replies: 0 comments

single1225
Apr 8, 2025