Can We Steal Your Vocal Identity From The Internet?: Initial Investigation Of Cloning Obama's Voice Using GAN, Wavenet And Low-quality Found Data
2018 Β· Jaime Lorenzo-Trueba, Fuming Fang, Xin Wang, et al.
Abstract
Thanks to the growing availability of spoofing databases and rapid advances in using them, systems for detecting voice spoofing attacks are becoming more and more capable, and error rates close to zero are being reached for the ASVspoof2015 database. However, speech synthesis and voice conversion paradigms that are not considered in the ASVspoof2015 database are appearing. Such examples include direct waveform modelling and generative adversarial networks. We also need to investigate the feasibility of training spoofing systems using only low-quality found data. For that purpose, we developed a generative adversarial network-based speech enhancement system that improves the quality of speech data found in publicly available sources. Using the enhanced data, we trained state-of-the-art text-to-speech and voice conversion models and evaluated them in terms of perceptual speech quality and speaker similarity. The results show that the enhancement models significantly improved the SNR of l
Authors
(none)
Tags
Stats
Related papers
- Securing Voice-driven Interfaces Against Fake (cloned) Audio Attacks (2019)9.92
- Neural Voice Cloning With A Few Samples (2018)0.00
- One-class Learning Towards Synthetic Voice Spoofing Detection (2020)17.31
- Spoofed Training Data For Speech Spoofing Countermeasure Can Be Efficiently Created Using Neural Vocoders (2022)11.93
- Data Efficient Voice Cloning For Neural Singing Synthesis (2019)10.07
- Voice Impersonation Using Generative Adversarial Networks (2018)13.23
- Defense Against Synthetic Speech: Real-time Detection Of RVC Voice Conversion Attacks (2025)0.00
- Experimental Study: Enhancing Voice Spoofing Detection Models With Wav2vec 2.0 (2024)0.00