Skip to content
Discussion options

You must be logged in to vote

In about 10 minutes it produced a .srt file whose first lines are...
There is no trace of the sentences I highlighted in bold in the audio file, at least not around those timestamps.
Amara.org is a subtitle service.

This is just a hallucination.
To reduce hallucinations try --vad_method pyannote_v3 and --model large-v2

Should I assume that the Whisper medium model was also trained with a subtitle from Amara.org for this specific movie?

No.

Replies: 2 comments 3 replies

Comment options

You must be logged in to vote
0 replies
Answer selected by Purfview
Comment options

You must be logged in to vote
3 replies
@Purfview
Comment options

@colemar
Comment options

@Purfview
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants