Text-independent Speaker Verification Using Long Short-term Memory Networks
2018 Β· Aryan Mobiny, Mohammad Najarian
Abstract
In this paper, an architecture based on Long Short-Term Memory Networks has been proposed for the text-independent scenario which is aimed to capture the temporal speaker-related information by operating over traditional speech features. For speaker verification, at first, a background model must be created for speaker representation. Then, in enrollment stage, the speaker models will be created based on the enrollment utterances. For this work, the model will be trained in an end-to-end fashion to combine the first two stages. The main goal of end-to-end training is the model being optimized to be consistent with the speaker verification protocol. The end- to-end training jointly learns the background and speaker models by creating the representation space. The LSTM architecture is trained to create a discrimination space for validating the match and non-match pairs for speaker verification. The proposed architecture demonstrate its superiority in the text-independent compared to othe
Authors
(none)
Tags
Stats
Related papers
- End-to-end Trainable Self-attentive Shallow Network For Text-independent Speaker Verification (2020)0.00
- Speaker Verification Using Convolutional Neural Networks (2018)0.00
- Rsknet-mtsp: Effective And Portable Deep Architecture For Speaker Verification (2021)9.03
- MFA: TDNN With Multi-scale Frequency-channel Attention For Text-independent Speaker Verification With Short Utterances (2022)13.79
- Deep Neural Network Based I-vector Mapping For Speaker Verification Using Short Utterances (2018)0.00
- End-to-end Attention Based Text-dependent Speaker Verification (2017)14.87
- Lstmse-net: Long Short Term Speech Enhancement Network For Audio-visual Speech Enhancement (2024)8.57
- Deep Speaker Feature Learning For Text-independent Speaker Verification (2017)12.54