Simultaneous Diarization And Separation Of Meetings Through The Integration Of Statistical Mixture Models
2024 Β· Tobias Cord-Landwehr, Christoph Boeddeker, Reinhold Haeb-Umbach
Abstract
We propose an approach for simultaneous diarization and separation of meeting data. It consists of a complex Angular Central Gaussian Mixture Model (cACGMM) for speech source separation, and a von-Mises-Fisher Mixture Model (VMFMM) for diarization in a joint statistical framework. Through the integration, both spatial and spectral information are exploited for diarization and separation. We also develop a method for counting the number of active speakers in a segment of a meeting to support block-wise processing. While the total number of speakers in a meeting may be known, it is usually not known on a per-segment level. With the proposed speaker counting, joint diarization and source separation can be done segment-by-segment, and the permutation problem across segments is solved, thus allowing for block-online processing in the future. Experimental results on the LibriCSS meeting corpus show that the integrated approach outperforms a cascaded approach of diarization and speech enhance
Authors
(none)
Tags
Stats
Related papers
- Integration Of Speech Separation, Diarization, And Recognition For Multi-speaker Meetings: System Description, Comparison, And Analysis (2020)13.23
- An Initialization Scheme For Meeting Separation With Spatial Mixture Models (2022)7.16
- TS-SEP: Joint Diarization And Separation Conditioned On Estimated Speaker Embeddings (2023)10.35
- Multi-microphone Automatic Speech Segmentation In Meetings Based On Circular Harmonics Features (2023)0.00
- Incorporating Spatial Cues In Modular Speaker Diarization For Multi-channel Multi-party Meetings (2024)4.52
- Unified Modeling Of Multi-talker Overlapped Speech Recognition And Diarization With A Sidecar Separator (2023)7.50
- Meeting Recognition With Continuous Speech Separation And Transcription-supported Diarization (2023)6.77
- All-neural Online Source Separation, Counting, And Diarization For Meeting Analysis (2019)13.05