A Novel Speech Feature Fusion Algorithm For Text-independent Speaker Recognition
2022 Β· Biao Ma, Chengben Xu, Ye Zhang
Abstract
A novel speech feature fusion algorithm with independent vector analysis (IVA) and parallel convolutional neural network (PCNN) is proposed for text-independent speaker recognition. Firstly, some different feature types, such as the time domain (TD) features and the frequency domain (FD) features, can be extracted from a speaker's speech, and the TD and the FD features can be considered as the linear mixtures of independent feature components (IFCs) with an unknown mixing system. To estimate the IFCs, the TD and the FD features of the speaker's speech are concatenated to build the TD and the FD feature matrix, respectively. Then, a feature tensor of the speaker's speech is obtained by paralleling the TD and the FD feature matrix. To enhance the dependence on different feature types and remove the redundancies of the same feature type, the independent vector analysis (IVA) can be used to estimate the IFC matrices of TD and FD features with the feature tensor. The IFC matrices are utiliz
Authors
(none)
Tags
Stats
Related papers
- P-vectors: A Parallel-coupled Tdnn/transformer Network For Speaker Verification (2023)5.84
- Fusion Of Embeddings Networks For Robust Combination Of Text Dependent And Independent Speaker Recognition (2021)4.52
- PCA/LDA Approach For Text-independent Speaker Recognition (2016)5.24
- Frequency And Temporal Convolutional Attention For Text-independent Speaker Recognition (2019)0.00
- A Text-independent Speaker Verification Model: A Comparative Analysis (2017)8.60
- Target Speech Extraction: Independent Vector Extraction Guided By Supervised Speaker Identification (2021)8.09
- FDN: Finite Difference Network With Hierarchical Convolutional Features For Text-independent Speaker Verification (2021)0.00
- Joint Speaker Features Learning For Audio-visual Multichannel Speech Separation And Recognition (2024)0.00