Overlap-aware Diarization: Resegmentation Using Neural End-to-end Overlapped Speech Detection
2019 · Latané Bullock, Hervé Bredin, Leibny Paola Garcia-Perera
Abstract
We address the problem of effectively handling overlapping speech in a diarization system. First, we detail a neural Long Short-Term Memory-based architecture for overlap detection. Secondly, detected overlap regions are exploited in conjunction with a frame-level speaker posterior matrix to make two-speaker assignments for overlapped frames in the resegmentation step. The overlap detection module achieves state-of-the-art performance on the AMI, DIHARD, and ETAPE corpora. We apply overlap-aware resegmentation on AMI, resulting in a 20% relative DER reduction over the baseline system. While this approach is by no means an end-all solution to overlap-aware diarization, it reveals promising directions for handling overlap.
Authors
(none)
Tags
Stats
Related papers
- Multi-class Spectral Clustering With Overlaps For Speaker Diarization (2020)10.35
- Speaker Embedding-aware Neural Diarization: An Efficient Framework For Overlapping Speech Diarization In Meeting Scenarios (2022)0.00
- Overlap-aware Low-latency Online Speaker Diarization Based On End-to-end Local Segmentation (2021)10.35
- Leveraging Speaker Embeddings In End-to-end Neural Diarization For Two-speaker Scenarios (2024)0.00
- Dover-lap: A Method For Combining Overlap-aware Diarization Outputs (2020)11.76
- End-to-end Speaker Diarization As Post-processing (2020)11.08
- Once More Diarization: Improving Meeting Transcription Systems Through Segment-level Speaker Reassignment (2024)5.24
- End-to-end Speaker Diarization Conditioned On Speech Activity And Overlap Detection (2021)8.82