|
1 | 1 | # Spectral Clustering |
2 | 2 | [](https://github.com/wq2012/SpectralCluster/actions) [](https://pypi.python.org/pypi/spectralcluster) [](https://pypi.org/project/spectralcluster) [](https://pepy.tech/project/spectralcluster) [](https://codecov.io/gh/wq2012/SpectralCluster) [](https://wq2012.github.io/SpectralCluster) |
3 | 3 |
|
4 | | -## Note |
| 4 | +## Overview |
| 5 | + |
| 6 | +This is a Python re-implementation of the spectral clustering and |
| 7 | +constrained spectral clustering algorithms in these two papers: |
5 | 8 |
|
6 | | -We are currently adding new functionalities to this library to include |
7 | | -some algorithms to appear in an upcoming paper. We are updating the APIs as |
8 | | -well. |
| 9 | +* [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/) |
| 10 | +* [Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection](https://arxiv.org/abs/2109.11641) |
| 11 | + |
| 12 | + |
| 13 | + |
| 14 | +## Notice |
| 15 | + |
| 16 | +We recently added new functionalities to this library to include |
| 17 | + algorithms in a [new paper](https://arxiv.org/abs/2109.11641). We updated the APIs as well. |
9 | 18 |
|
10 | 19 | If you depend on our old API, please use an **older version** of this library: |
11 | 20 | ``` |
12 | 21 | pip3 install spectralcluster==0.1.0 |
13 | 22 | ``` |
14 | 23 |
|
15 | | -## Overview |
16 | | - |
17 | | -This is a Python re-implementation of the spectral clustering algorithm in the |
18 | | -paper [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/). |
19 | | - |
20 | | - |
21 | | - |
22 | 24 | ## Disclaimer |
23 | 25 |
|
24 | 26 | **This is not a Google product.** |
@@ -49,7 +51,8 @@ python3 -m pip install spectralcluster |
49 | 51 |
|
50 | 52 | Simply use the `predict()` method of class `SpectralClusterer` to perform |
51 | 53 | spectral clustering. The example below should be closest to the original C++ |
52 | | -implemention used my our [ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/). |
| 54 | +implemention used our |
| 55 | +[ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/). |
53 | 56 |
|
54 | 57 | ```python |
55 | 58 | from spectralcluster import configs |
@@ -163,7 +166,9 @@ For the complete list of parameters of `AutoTune`, see |
163 | 166 |
|
164 | 167 | ### Constrained spectral clustering |
165 | 168 |
|
166 | | -We also implemented 2 constrained spectral clustering methods: |
| 169 | +In the [Turn-to-Diarize paper](https://arxiv.org/abs/2109.11641), |
| 170 | +the spectral clustering is constrained by speaker turns. |
| 171 | +We implemented two constrained spectral clustering methods: |
167 | 172 |
|
168 | 173 | * Affinity integration. |
169 | 174 | * Constraint propagation (see paper [[1](https://link.springer.com/chapter/10.1007/978-3-642-15567-3_1)] and [[2](https://arxiv.org/abs/1109.4684)]). |
@@ -203,22 +208,29 @@ constraint_matrix = constraint.ConstraintMatrix( |
203 | 208 |
|
204 | 209 | ## Citations |
205 | 210 |
|
206 | | -Our paper is cited as: |
| 211 | +Our papers are cited as: |
207 | 212 |
|
208 | 213 | ``` |
209 | 214 | @inproceedings{wang2018speaker, |
210 | | - title={Speaker diarization with lstm}, |
| 215 | + title={{Speaker Diarization with LSTM}}, |
211 | 216 | author={Wang, Quan and Downey, Carlton and Wan, Li and Mansfield, Philip Andrew and Moreno, Ignacio Lopz}, |
212 | 217 | booktitle={2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, |
213 | 218 | pages={5239--5243}, |
214 | 219 | year={2018}, |
215 | 220 | organization={IEEE} |
216 | 221 | } |
| 222 | +
|
| 223 | +@article{xia2021turn, |
| 224 | + title={{Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection}}, |
| 225 | + author={Wei Xia and Han Lu and Quan Wang and Anshuman Tripathi and Ignacio Lopez Moreno and Hasim Sak}, |
| 226 | + journal={arXiv preprint arXiv:2109.11641}, |
| 227 | + year={2021} |
| 228 | +} |
217 | 229 | ``` |
218 | 230 |
|
219 | 231 | ## Misc |
220 | 232 |
|
221 | | -Our new speaker diarization systems are now fully supervised, powered by |
| 233 | +We also have fully supervised speaker diarization systems, powered by |
222 | 234 | [uis-rnn](https://github.com/google/uis-rnn). |
223 | 235 | Check this [Google AI Blog](https://ai.googleblog.com/2018/11/accurate-online-speaker-diarization.html). |
224 | 236 |
|
|
0 commit comments