Help with f5-tts_infer-cli Configuration (pt/br model) #774
-
|
Hello! When I use the model without I'm using it like this: #!/bin/bash
f5-tts_infer-cli \
--REF_AUDIO "samples/refaudio/myvoice.mp3" \ # contains audio in Brazilian Portuguese
--REF_TEXT "samples/refaudio/myvoice.txt" \ # contains the transcription
--CKPT_FILE "modelos/firepixel/ptbr/model_last.pt" \
--GEN_TEXT "Isso é um teste de geração de áudio em português Brasileiro" \
-w resultado.wavCould you guide me on what I might be doing wrong? |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 1 reply
-
|
two possible reasons:
|
Beta Was this translation helpful? Give feedback.
-
|
Fala Marvin, ta conseguindo usar o F5 em portugues com sua voz? Queria uma ajuda tb, tem um tutorial do que voce fez? |
Beta Was this translation helpful? Give feedback.
-
|
Fala galera, deu certo? Tambem tou recebendo "gibberish" como output, nao consigo gerar com esse "model_last.pt". Por favor avisa se tiver conseguido. |
Beta Was this translation helpful? Give feedback.
-
|
@SWivid Do we need a specific vocab.txt file to run this model in Portuguese? I am able to run successfully, but the audio comes out as gibberish using the default "Emilia_ZH_EN_pinyin" vocab.txt file. I am guessing that this is the issue, but not sure. |
Beta Was this translation helpful? Give feedback.
two possible reasons:
ref_textneed to be text but not file pathhttps://github.com/SWivid/F5-TTS/tree/main/src/f5_tts/infer
if your finetuned model is trained with shorter audio samples, need to make sure total length (ref + gen audio length) shorter than the max length seen during finetuning.
modify in e.g.
https://github.com/SWivid/F5-TTS/blob/main/src/f5_tts/infer/utils_infer.py
F5-TTS/src/f5_tts/infer/utils_infer.py
Line 291 in f062403
F5-TTS/src/f5_tts/infer/utils_infer.py
Lines 385 to 386 in f062403