- VAD training paradigm reworked;
- Overall slight quality improvement (no metrics update);
- Higher stability on OOD / rare / strange / unique data;
- Significant quality improvements on various known edge cases:
- Unusual voices
- Child voices
- Cartoon voices
- Muted voices
- Muted speech
- Lower quality phone calls
What's Changed
- Fix type hint for min_silence_at_max_speech (float -> int) by @Purfview in #714
- Adamnsandle by @adamnsandle in #717
- Adamnsandle by @adamnsandle in #719
Full Changelog: v6.1...v6.2