Speaker Anonymization Using X-vector And Neural Waveform Models
2019 Β· Fuming Fang, Xin Wang, Junichi Yamagishi, et al.
Abstract
The social media revolution has produced a plethora of web services to which users can easily upload and share multimedia documents. Despite the popularity and convenience of such services, the sharing of such inherently personal data, including speech data, raises obvious security and privacy concerns. In particular, a user's speech data may be acquired and used with speech synthesis systems to produce high-quality speech utterances which reflect the same user's speaker identity. These utterances may then be used to attack speaker verification systems. One solution to mitigate these concerns involves the concealing of speaker identities before the sharing of speech data. For this purpose, we present a new approach to speaker anonymization. The idea is to extract linguistic and speaker identity features from an utterance and then to use these with neural acoustic and waveform models to synthesize anonymized speech. The original speaker identity, in the form of timbre, is suppressed and
Authors
(none)
Tags
Stats
Related papers
- Speaker Anonymization Using Neural Audio Codec Language Models (2023)10.97
- Speaker Anonymization With Distribution-preserving X-vector Generation For The Voiceprivacy Challenge 2020 (2020)0.00
- Distinctive And Natural Speaker Anonymization Via Singular Value Transformation-assisted Matrix (2024)8.82
- Exploring The Importance Of F0 Trajectories For Speaker Anonymization Using X-vectors And Neural Waveform Models (2021)0.00
- Voiceprivacy 2022 System Description: Speaker Anonymization With Feature-matched F0 Trajectories (2022)0.00
- Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding (2024)7.16
- A Speech Representation Anonymization Framework Via Selective Noise Perturbation (2022)6.34
- Language-independent Speaker Anonymization Approach Using Self-supervised Pre-trained Models (2022)9.92