Semi-supervised Training With Pseudo-labeling For End-to-end Neural Diarization
2021 Β· Yuki Takashima, Yusuke Fujita, Shota Horiguchi, et al.
Abstract
In this paper, we present a semi-supervised training technique using pseudo-labeling for end-to-end neural diarization (EEND). The EEND system has shown promising performance compared with traditional clustering-based methods, especially in the case of overlapping speech. However, to get a well-tuned model, EEND requires labeled data for all the joint speech activities of every speaker at each time frame in a recording. In this paper, we explore a pseudo-labeling approach that employs unlabeled data. First, we propose an iterative pseudo-label method for EEND, which trains the model using unlabeled data of a target condition. Then, we also propose a committee-based training method to improve the performance of EEND. To evaluate our proposed method, we conduct the experiments of model adaptation using labeled and unlabeled data. Experimental results on the CALLHOME dataset show that our proposed pseudo-label achieved a 37.4% relative diarization error rate reduction compared to a seed m
Authors
(none)
Tags
Stats
Related papers
- End-to-end Neural Diarization: Reformulating Speaker Diarization As Simple Multi-label Classification (2020)0.00
- End-to-end Speaker Diarization Conditioned On Speech Activity And Overlap Detection (2021)8.82
- Advances In Integration Of End-to-end Neural And Clustering-based Diarization For Real Conversational Speech (2021)16.48
- Speech-aware Neural Diarization With Encoder-decoder Attractor Guided By Attention Constraints (2024)0.00
- Towards Word-level End-to-end Neural Speaker Diarization With Auxiliary Network (2023)0.00
- Integrating End-to-end Neural And Clustering-based Diarization: Getting The Best Of Both Worlds (2020)13.74
- Probabilistic Fusion And Calibration Of Neural Speaker Diarization Models (2025)0.00
- Diaper: End-to-end Neural Diarization With Perceiver-based Attractors (2023)9.59