Optimization Of Data-driven Filterbank For Automatic Speaker Verification
2020 Β· Susanta Sarangi, Md Sahidullah, Goutam Saha
Abstract
Most of the speech processing applications use triangular filters spaced in mel-scale for feature extraction. In this paper, we propose a new data-driven filter design method which optimizes filter parameters from a given speech data. First, we introduce a frame-selection based approach for developing speech-signal-based frequency warping scale. Then, we propose a new method for computing the filter frequency responses by using principal component analysis (PCA). The main advantage of the proposed method over the recently introduced deep learning based methods is that it requires very limited amount of unlabeled speech-data. We demonstrate that the proposed filterbank has more speaker discriminative power than commonly used mel filterbank as well as existing data-driven filterbank. We conduct automatic speaker verification (ASV) experiments with different corpora using various classifier back-ends. We show that the acoustic features created with proposed filterbank are better than exis
Authors
(none)
Tags
Stats
Related papers
- Deepvox: Discovering Features From Raw Audio For Speaker Recognition In Non-ideal Audio Signals (2020)0.00
- Learnable Frequency Filters For Speech Feature Extraction In Speaker Verification (2022)0.00
- Detection Of Doctored Speech: Towards An End-to-end Parametric Learn-able Filter Approach (2022)0.00
- Filterbank Design For End-to-end Speech Separation (2019)12.17
- Unsupervised Feature Enhancement For Speaker Verification (2019)5.84
- Application Of ASV For Voice Identification After VC And Duration Predictor Improvement In TTS Models (2024)0.00
- Neural Network Based Speaker Classification And Verification Systems With Enhanced Features (2017)8.60
- A Unified Deep Learning Framework For Short-duration Speaker Verification In Adverse Environments (2020)9.41