❓ Questions / Help / Support #692

chuanmingliu · 2025-10-02T05:14:41Z

chuanmingliu
Oct 2, 2025

hi,

do we have a speech-to-non-speech transitions (latency analysis) comparison between v6 and ten-vad?

any experimental results?

thanks.

Oct 2, 2025

TEN VAD rapidly detects speech-to-non-speech transitions, whereas Silero VAD suffers from a delay of several hundred milliseconds, resulting in increased end-to-end latency in human-agent interaction systems. In addition, as demonstrated in the 6.5s-7.0s audio segment, Silero VAD fails to identify short silent durations between adjacent speech segments.

Our window size is 30ms. Most likely it takes several windows to detect end of speech. But even 50-100ms is not a big deal, since this is as accurate as a VAD can get anyway.

We benchmarked ten vad, and it behaves not very well on real life audios, more akin to web rtc, hence the need to highlight this non-issue.

View full answer

snakers4 · 2025-10-02T05:24:28Z

snakers4
Oct 2, 2025
Maintainer

TEN VAD rapidly detects speech-to-non-speech transitions, whereas Silero VAD suffers from a delay of several hundred milliseconds, resulting in increased end-to-end latency in human-agent interaction systems. In addition, as demonstrated in the 6.5s-7.0s audio segment, Silero VAD fails to identify short silent durations between adjacent speech segments.

Our window size is 30ms. Most likely it takes several windows to detect end of speech. But even 50-100ms is not a big deal, since this is as accurate as a VAD can get anyway.

We benchmarked ten vad, and it behaves not very well on real life audios, more akin to web rtc, hence the need to highlight this non-issue.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

❓ Questions / Help / Support #692

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

❓ Questions / Help / Support #692

Uh oh!

Uh oh!

chuanmingliu Oct 2, 2025

Replies: 1 comment

Uh oh!

snakers4 Oct 2, 2025 Maintainer

chuanmingliu
Oct 2, 2025

snakers4
Oct 2, 2025
Maintainer