Optimize weighted embedding extraction with pyannote 3.1

With pyannote 3.1, we could do only 1 forward pass of the audio instead of `num_speakers` when extracting embeddings with weights. This is probably at least one of the causes behind the pytorch version of the wespeaker embedding model being that much slower.

This optimization would also reduce the latency of `pyannote/embedding` so both would need to be re-computed in the README table.

**Important:** we should verify that this method is also compatible with masking (e.g. in speechbrain embeddings)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize weighted embedding extraction with pyannote 3.1 #214

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Optimize weighted embedding extraction with pyannote 3.1 #214

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions