-
-
Notifications
You must be signed in to change notification settings - Fork 110
Open
Labels
featureNew feature or requestNew feature or request
Milestone
Description
With pyannote 3.1, we could do only 1 forward pass of the audio instead of num_speakers when extracting embeddings with weights. This is probably at least one of the causes behind the pytorch version of the wespeaker embedding model being that much slower.
This optimization would also reduce the latency of pyannote/embedding so both would need to be re-computed in the README table.
Important: we should verify that this method is also compatible with masking (e.g. in speechbrain embeddings)
Metadata
Metadata
Assignees
Labels
featureNew feature or requestNew feature or request