Parameter-free Attentive Scoring For Speaker Verification
2022 Β· Jason Pelecanos, Quan Wang, Yiling Huang, et al.
Abstract
This paper presents a novel study of parameter-free attentive scoring for speaker verification. Parameter-free scoring provides the flexibility of comparing speaker representations without the need of an accompanying parametric scoring model. Inspired by the attention component in Transformer neural networks, we propose a variant of the scaled dot product attention mechanism to compare enrollment and test segment representations. In addition, this work explores the effect on performance of (i) different types of normalization, (ii) independent versus tied query/key estimation, (iii) varying the number of key-value pairs and (iv) pooling multiple enrollment utterance statistics. Experimental results for a 4 task average show that a simple parameter-free attentive scoring mechanism can improve the average EER by 10% over the best cosine similarity baseline.
Authors
(none)
Tags
Stats
Related papers
- Phonetic-attention Scoring For Deep Speaker Features In Speaker Verification (2018)2.26
- An Attention-based Backend Allowing Efficient Fine-tuning Of Transformer Models For Speaker Verification (2022)11.08
- Self-attentive Multi-layer Aggregation With Feature Recalibration And Normalization For End-to-end Speaker Verification System (2020)0.00
- Neural Scoring: A Refreshed End-to-end Approach For Speaker Recognition In Complex Conditions (2024)0.00
- Attentive Statistics Pooling For Deep Speaker Embedding (2018)18.88
- Frequency And Multi-scale Selective Kernel Attention For Speaker Verification (2022)10.07
- Parameterized Channel Normalization For Far-field Deep Speaker Verification (2021)3.58
- A Discriminative Condition-aware Backend For Speaker Verification (2019)6.34