Channel Adaptation For Speaker Verification Using Optimal Transport With Pseudo Label
2024 Β· Wenhao Yang, Jianguo Wei, Wenhuan Lu, et al.
Abstract
Domain gap often degrades the performance of speaker verification (SV) systems when the statistical distributions of training data and real-world test speech are mismatched. Channel variation, a primary factor causing this gap, is less addressed than other issues (e.g., noise). Although various domain adaptation algorithms could be applied to handle this domain gap problem, most algorithms could not take the complex distribution structure in domain alignment with discriminative learning. In this paper, we propose a novel unsupervised domain adaptation method, i.e., Joint Partial Optimal Transport with Pseudo Label (JPOT-PL), to alleviate the channel mismatch problem. Leveraging the geometric-aware distance metric of optimal transport in distribution alignment, we further design a pseudo label-based discriminative learning where the pseudo label can be regarded as a new type of soft speaker label derived from the optimal coupling. With the JPOT-PL, we carry out experiments on the SV cha
Authors
(none)
Tags
Stats
Related papers
- Neural Domain Alignment For Spoken Language Recognition Based On Optimal Transport (2023)0.00
- Optimal Transport-based Adaptation In Dysarthric Speech Tasks (2021)0.00
- Speaker Verification Using End-to-end Adversarial Language Adaptation (2018)11.19
- Source -free Domain Adaptation For Speaker Verification In Data-scarce Languages And Noisy Channels (2024)0.00
- Interpretable Dysarthric Speaker Adaptation Based On Optimal-transport (2022)2.26
- Unsupervised Noise Adaptive Speech Enhancement By Discriminator-constrained Optimal Transport (2021)0.00
- Multi-domain Adaptation By Self-supervised Learning For Speaker Verification (2023)0.00
- Progressive Sub-graph Clustering Algorithm For Semi-supervised Domain Adaptation Speaker Verification (2023)0.00