Generative Adversarial Network Based Voice Conversion: Techniques, Challenges, And Recent Advancements
2025 Β· Sandipan Dhar, Nanda Dulal Jana, Swagatam Das
Abstract
Voice conversion (VC) stands as a crucial research area in speech synthesis, enabling the transformation of a speaker's vocal characteristics to resemble another while preserving the linguistic content. This technology has broad applications, including automated movie dubbing, speech-to-singing conversion, and assistive devices for pathological speech rehabilitation. With the increasing demand for high-quality and natural-sounding synthetic voices, researchers have developed a wide range of VC techniques. Among these, generative adversarial network (GAN)-based approaches have drawn considerable attention for their powerful feature-mapping capabilities and potential to produce highly realistic speech. Despite notable advancements, challenges such as ensuring training stability, maintaining linguistic consistency, and achieving perceptual naturalness continue to hinder progress in GAN-based VC systems. This systematic review presents a comprehensive analysis of the voice conversion lands
Authors
(none)
Tags
Stats
Related papers
- An Adaptive Learning Based Generative Adversarial Network For One-to-one Voice Conversion (2021)10.61
- Investigating Deep Neural Structures And Their Interpretability In The Domain Of Voice Conversion (2021)0.00
- Subband-based Generative Adversarial Network For Non-parallel Many-to-many Voice Conversion (2022)0.00
- Starganv2-vc: A Diverse, Unsupervised, Non-parallel Framework For Natural-sounding Voice Conversion (2021)13.70
- Beyond Voice Identity Conversion: Manipulating Voice Attributes By Adversarial Learning Of Structured Disentangled Representations (2021)0.00
- An Overview Of Voice Conversion And Its Challenges: From Statistical Modeling To Deep Learning (2020)18.53
- CVC: Contrastive Learning For Non-parallel Voice Conversion (2020)7.50
- Stargan-vc: Non-parallel Many-to-many Voice Conversion With Star Generative Adversarial Networks (2018)18.09