Dover-lap: A Method For Combining Overlap-aware Diarization Outputs
2020 Β· Desh Raj, Leibny Paola Garcia-Perera, Zili Huang, et al.
Abstract
Several advances have been made recently towards handling overlapping speech for speaker diarization. Since speech and natural language tasks often benefit from ensemble techniques, we propose an algorithm for combining outputs from such diarization systems through majority voting. Our method, DOVER-Lap, is inspired from the recently proposed DOVER algorithm, but is designed to handle overlapping segments in diarization outputs. We also modify the pair-wise incremental label mapping strategy used in DOVER, and propose an approximation algorithm based on weighted k-partite graph matching, which performs this mapping using a global cost tensor. We demonstrate the strength of our method by combining outputs from diverse systems -- clustering-based, region proposal networks, and target-speaker voice activity detection -- on AMI and LibriCSS datasets, where it consistently outperforms the single best system. Additionally, we show that DOVER-Lap can be used for late fusion in multichannel di
Authors
(none)
Tags
Stats
Related papers
- DOVER: A Method For Combining Diarization Outputs (2019)8.60
- Improving Diarization Robustness Using Diversification, Randomization And The DOVER Algorithm (2019)0.00
- Overlap-aware Diarization: Resegmentation Using Neural End-to-end Overlapped Speech Detection (2019)13.17
- Multi-class Spectral Clustering With Overlaps For Speaker Diarization (2020)10.35
- Overlap-aware Low-latency Online Speaker Diarization Based On End-to-end Local Segmentation (2021)10.35
- Probabilistic Fusion And Calibration Of Neural Speaker Diarization Models (2025)0.00
- End-to-end Speaker Diarization As Post-processing (2020)11.08
- Channel-combination Algorithms For Robust Distant Voice Activity And Overlapped Speech Detection (2024)6.34