Skip to content

Conversation

@mohamedalibarkailluin
Copy link

Issue :

Sometimes, the generated transcription gets stuck repeating the same words, and there is a warning :

Compression ratio threshold is not met with temperature 0.0

The problem is that the default value for temperature is 0 in speaches. The problem can be solved if we pass a higher temperature but it will apply to all the transcript.

The default behavior in faster-whisper is that if temperature is not specified, it is considered 0 and only when compression ratio threshold or log probability threshold are not met for a certain segment then it will try with temperature 0.2 and so on from the list [0, 0.2, 0.4, 0.6, 0.8, 1.0] for that segment.

Fix :

Change the default value for temperature to the same default list used in faster-whisper to let it handle retries for segments where compression ratio threshold or log probability threshold criterias are not met.

@lemeur
Copy link

lemeur commented Oct 16, 2025

Thanks for the fix, I confirm it fixes repeated words issues when using Speaches for us.

@dotmobo
Copy link

dotmobo commented Oct 27, 2025

thanks, +1 for the fix here

@lemeur
Copy link

lemeur commented Nov 6, 2025

Any news on this PR ?
Do you need more information ?

@lmorin-inria
Copy link

+1 for the fix here
I confirm that 1 - I have the problem with large audio files 2 - that the patch fixes the problem.
I would appreciate the patch to be integrated in the release.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants