Detection Of Doctored Speech: Towards An End-to-end Parametric Learn-able Filter Approach
2022 Β· Rohit Arora
Abstract
The Automatic Speaker Verification systems have potential in biometrics applications for logical control access and authentication. A lot of things happen to be at stake if the ASV system is compromised. The preliminary work presents a comparative analysis of the wavelet and MFCC-based state-of-the-art spoof detection techniques developed in these papers, respectively (Novoselov et al., 2016) (Alam et al., 2016a). The results on ASVspoof 2015 justify our inclination towards wavelet-based features instead of MFCC features. The experiments on the ASVspoof 2019 database show the lack of credibility of the traditional handcrafted features and give us more reason to progress towards using end-to-end deep neural networks and more recent techniques. We use Sincnet architecture as our baseline. We get E2E deep learning models, which we call WSTnet and CWTnet, respectively, by replacing the Sinc layer with the Wavelet Scattering and Continuous wavelet transform layers. The fusion model achieved
Authors
(none)
Tags
Stats
Related papers
- Automatic Speaker Verification Spoofing And Deepfake Detection Using Wav2vec 2.0 And Data Augmentation (2022)17.35
- Securing Voice Biometrics: One-shot Learning Approach For Audio Deepfake Detection (2023)9.03
- Representation Selective Self-distillation And Wav2vec 2.0 Feature Exploration For Spoof-aware Speaker Verification (2022)9.03
- Experimental Study: Enhancing Voice Spoofing Detection Models With Wav2vec 2.0 (2024)0.00
- A Study On Convolutional Neural Network Based End-to-end Replay Anti-spoofing (2018)0.00
- Exploring Wavlm Back-ends For Speech Spoofing And Deepfake Detection (2024)4.52
- Anti-spoofing Methods For Automatic Speakerverification System (2017)2.26
- Combining Automatic Speaker Verification And Prosody Analysis For Synthetic Speech Detection (2022)10.48