Learning Speaker Representation With Semi-supervised Learning Approach For Speaker Profiling
2021 Β· Shangeth Rajaa, Pham van Tung, Chng Eng Siong
Abstract
Speaker profiling, which aims to estimate speaker characteristics such as age and height, has a wide range of applications inforensics, recommendation systems, etc. In this work, we propose a semisupervised learning approach to mitigate the issue of low training data for speaker profiling. This is done by utilizing external corpus with speaker information to train a better representation which can help to improve the speaker profiling systems. Specifically, besides the standard supervised learning path, the proposed framework has two more paths: (1) an unsupervised speaker representation learning path that helps to capture the speaker information; (2) a consistency training path that helps to improve the robustness of the system by enforcing it to produce similar predictions for utterances of the same speaker.The proposed approach is evaluated on the TIMIT and NISP datasets for age, height, and gender estimation, while the Librispeech is used as the unsupervised external corpus. Traine
Authors
(none)
Tags
Stats
Related papers
- Graph-based Label Propagation For Semi-supervised Speaker Identification (2021)8.09
- TIMIT Speaker Profiling: A Comparison Of Multi-task Learning And Single-task Learning Approaches (2024)0.00
- Curriculum Learning For Self-supervised Speaker Verification (2022)8.09
- Semi-supervised Learning For Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation (2020)5.24
- SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning (2022)2.26
- Estimation Of Speaker Age And Height From Speech Signal Using Bi-encoder Transformer Mixture Model (2022)8.09
- Overview Of Speaker Modeling And Its Applications: From The Lens Of Deep Speaker Representation Learning (2024)10.74
- Leveraging Speaker Attribute Information Using Multi Task Learning For Speaker Verification And Diarization (2020)6.34