Adversarial Speech For Voice Privacy Protection From Personalized Speech Generation
2024 Β· Shihao Chen, Liping Chen, Jie Zhang, et al.
Abstract
The rapid progress in personalized speech generation technology, including personalized text-to-speech (TTS) and voice conversion (VC), poses a challenge in distinguishing between generated and real speech for human listeners, resulting in an urgent demand in protecting speakers' voices from malicious misuse. In this regard, we propose a speaker protection method based on adversarial attacks. The proposed method perturbs speech signals by minimally altering the original speech while rendering downstream speech generation models unable to accurately generate the voice of the target speaker. For validation, we employ the open-source pre-trained YourTTS model for speech generation and protect the target speaker's speech in the white-box scenario. Automatic speaker verification (ASV) evaluations were carried out on the generated speech as the assessment of the voice protection capability. Our experimental results show that we successfully perturbed the speaker encoder of the YourTTS model
Authors
(none)
Tags
Stats
Related papers
- Safespeech: Robust And Universal Voice Protection Against Malicious Speech Synthesis (2025)0.00
- Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding (2024)7.16
- Vsmask: Defending Against Voice Synthesis Attack Via Real-time Predictive Perturbation (2023)7.81
- Privacy-utility Balanced Voice De-identification Using Adversarial Examples (2022)0.00
- Ghostvec: A New Threat To Speaker Privacy Of End-to-end Speech Recognition System (2023)0.00
- Inaudible Adversarial Perturbations For Targeted Attack In Speaker Recognition (2020)12.33
- Impact Of Phonetics On Speaker Identity In Adversarial Voice Attack (2025)0.00
- One-class Learning Towards Synthetic Voice Spoofing Detection (2020)17.31