Skip to content
Discussion options

You must be logged in to vote

core codes with model here https://github.com/SWivid/F5-TTS/tree/main/src/f5_tts/model

basically change the text format is fine

text: int["b nt"] | list[str], # noqa: F722

# handle text as string
if isinstance(text, list):
if exists(self.vocab_char_map):
text = list_str_to_idx(text, self.vocab_char_map).to(device)
else:
text = list_str_to_tensor(text).to(device)
assert text.shape[0] == batch

text is currently input like: [['h', 'o', 'w', ' ', 'a', 'r', 'e', ' ', 'y', 'o', 'u', '?',], ['i', ''', 'm', ' ', 'f', 'i', '…

Replies: 4 comments 13 replies

Comment options

You must be logged in to vote
1 reply
@Alykasym
Comment options

Answer selected by Alykasym
Comment options

You must be logged in to vote
1 reply
@Alykasym
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
11 replies
@Alykasym
Comment options

@dpp-user
Comment options

@Alykasym
Comment options

@campar
Comment options

@Alykasym
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
6 participants