A Speech Representation Anonymization Framework Via Selective Noise Perturbation
2022 Β· Minh Tran, Mohammad Soleymani
Abstract
Privacy and security are major concerns when communicating speech signals to cloud services such as automatic speech recognition (ASR) and speech emotion recognition (SER). Existing solutions for speech anonymization mainly focus on voice conversion or voice modification to convert a raw utterance into another one with similar content but different, or no, identity-related information. However, an alternative approach to share speech data under the form of privacy-preserving representation has been largely under-explored. In this paper, we propose a speech anonymization framework that achieves privacy via noise perturbation to a selected subset of the high-utility representations extracted using a pre-trained speech encoder. The subset is chosen with a Transformer-based privacy-risk saliency estimator. We validate our framework on four tasks, namely, Automatic Speaker Verification (ASV), ASR, SER and Intent Classification (IC) for privacy and utility assessment. Experimental results sh
Authors
(none)
Tags
Stats
Related papers
- Asynchronous Voice Anonymization Using Adversarial Perturbation On Speaker Embedding (2024)7.16
- Self-supervised Speech Representations Preserve Speech Characteristics While Anonymizing Voices (2022)0.00
- Speaker Anonymization Using X-vector And Neural Waveform Models (2019)0.00
- Anonymizing Speech With Generative Adversarial Networks To Preserve Speaker Privacy (2022)11.19
- Reprogramming Self-supervised Learning-based Speech Representations For Speaker Anonymization (2023)2.26
- NPU-NTU System For Voice Privacy 2024 Challenge (2024)7.16
- On-device Speaker Anonymization Of Acoustic Embeddings For ASR Based Onflexible Location Gradient Reversal Layer (2023)0.00
- Speaker Anonymization Using Neural Audio Codec Language Models (2023)10.97