Angular Softmax Loss For End-to-end Speaker Verification
2018 Β· Yutian Li, Feng Gao, Zhijian Ou, et al.
Abstract
End-to-end speaker verification systems have received increasing interests. The traditional i-vector approach trains a generative model (basically a factor-analysis model) to extract i-vectors as speaker embeddings. In contrast, the end-to-end approach directly trains a discriminative model (often a neural network) to learn discriminative speaker embeddings; a crucial component is the training criterion. In this paper, we use angular softmax (A-softmax), which is originally proposed for face verification, as the loss function for feature learning in end-to-end speaker verification. By introducing margins between classes into softmax loss, A-softmax can learn more discriminative features than softmax loss and triplet loss, and at the same time, is easy and stable for usage. We make two contributions in this work. 1) We introduce A-softmax loss into end-to-end speaker verification and achieve significant EER reductions. 2) We find that the combination of using A-softmax in training the f
Authors
(none)
Tags
Stats
Related papers
- Large Margin Softmax Loss For Speaker Verification (2019)14.66
- A Study On Angular Based Embedding Learning For Text-independent Speaker Verification (2019)2.26
- Adapting End-to-end Neural Speaker Verification To New Languages And Recording Conditions With Adversarial Training (2018)9.59
- A Comparison Of Metric Learning Loss Functions For End-to-end Speaker Verification (2020)6.77
- Experimenting With Additive Margins For Contrastive Self-supervised Speaker Verification (2023)4.52
- End-to-end Losses Based On Speaker Basis Vectors And All-speaker Hard Negative Mining For Speaker Verification (2019)0.00
- Generalized End-to-end Loss For Speaker Verification (2017)20.58
- Discriminative Speaker Representation Via Contrastive Learning With Class-aware Attention In Angular Space (2022)8.60