Impact Of Phonetics On Speaker Identity In Adversarial Voice Attack
2025 Β· Daniyal Kabir Dar, Qiben Yan, Li Xiao, et al.
Abstract
Adversarial perturbations in speech pose a serious threat to automatic speech recognition (ASR) and speaker verification by introducing subtle waveform modifications that remain imperceptible to humans but can significantly alter system outputs. While targeted attacks on end-to-end ASR models have been widely studied, the phonetic basis of these perturbations and their effect on speaker identity remain underexplored. In this work, we analyze adversarial audio at the phonetic level and show that perturbations exploit systematic confusions such as vowel centralization and consonant substitutions. These distortions not only mislead transcription but also degrade phonetic cues critical for speaker verification, leading to identity drift. Using DeepSpeech as our ASR target, we generate targeted adversarial examples and evaluate their impact on speaker embeddings across genuine and impostor samples. Results across 16 phonetically diverse target phrases demonstrate that adversarial audio indu
Authors
(none)
Tags
Stats
Related papers
- Inaudible Adversarial Perturbations For Targeted Attack In Speaker Recognition (2020)12.33
- Attacking Voice Anonymization Systems With Augmented Feature And Speaker Identity Difference (2024)6.34
- Adversarial Attacks Against Automatic Speech Recognition Systems Via Psychoacoustic Hiding (2018)16.45
- Diffattack: Diffusion-based Timbre-reserved Adversarial Attack In Speaker Identification (2025)0.00
- Privacy-utility Balanced Voice De-identification Using Adversarial Examples (2022)0.00
- Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding (2024)7.16
- Transforming Acoustic Characteristics To Deceive Playback Spoofing Countermeasures Of Speaker Verification Systems (2018)6.34
- Adversarial Attack And Defense Strategies For Deep Speaker Recognition Systems (2020)13.39