-
|
Hi all, I tested the audio-to-text Whisper tool for the first time with a 2 hours audio file extracted from the italian dubbing of the movie "On the Beach" (1959). According to In about 10 minutes it produced a .srt file whose first lines are: (english: "subtitles and revision at Kanataka Ready to surface") (english: "subtitles created by community Amara.org") There is no trace of the sentences I highlighted in bold in the audio file, at least not around those timestamps. Should I assume that the Whisper medium model was also trained with a subtitle from Amara.org for this specific movie? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
This is just a hallucination.
No. |
Beta Was this translation helpful? Give feedback.
-
|
Yes, much better with this command line (run from Subtitle Edit): |
Beta Was this translation helpful? Give feedback.
This is just a hallucination.
To reduce hallucinations try
--vad_method pyannote_v3and--model large-v2No.