Bayesian HMM Clustering Of X-vector Sequences (vbx) In Speaker Diarization: Theory, Implementation And Analysis On Standard Tasks
2020 Β· Federico Landini, JΓ‘n Profant, Mireia Diez, et al.
Abstract
The recently proposed VBx diarization method uses a Bayesian hidden Markov model to find speaker clusters in a sequence of x-vectors. In this work we perform an extensive comparison of performance of the VBx diarization with other approaches in the literature and we show that VBx achieves superior performance on three of the most popular datasets for evaluating diarization: CALLHOME, AMI and DIHARDII datasets. Further, we present for the first time the derivation and update formulae for the VBx model, focusing on the efficiency and simplicity of this model as compared to the previous and more complex BHMM model working on frame-by-frame standard Cepstral features. Together with this publication, we release the recipe for training the x-vector extractors used in our experiments on both wide and narrowband data, and the VBx recipes that attain state-of-the-art performance on all three datasets. Besides, we point out the lack of a standardized evaluation protocol for AMI dataset and we pr
Authors
(none)
Tags
Stats
Related papers
- Discriminative Training Of Vbx Diarization (2023)5.84
- Multi-stream Extension Of Variational Bayesian HMM Clustering (ms-vbx) For Combined End-to-end And Vector Clustering-based Diarization (2023)0.00
- Bayesian X-vector: Bayesian Neural Network Based X-vector System For Speaker Verification (2020)6.77
- Speaker Diarization Using Two-pass Leave-one-out Gaussian PLDA Clustering Of DNN Embeddings (2021)2.26
- The HUAWEI Speaker Diarisation System For The Voxceleb Speaker Diarisation Challenge (2020)0.00
- A Study Of Semi-supervised Speaker Diarization System Using Gan Mixture Model (2019)0.00
- Target-speaker Voice Activity Detection: A Novel Approach For Multi-speaker Diarization In A Dinner Party Scenario (2020)16.19
- Analysis Of The BUT Diarization System For Voxconverse Challenge (2020)8.82