High-fidelity Audio Generation And Representation Learning With Guided Adversarial Autoencoder
2020 · Kazi Nazmul Haque, Rajib Rana, Björn W Schuller
Abstract
Unsupervised disentangled representation learning from the unlabelled audio data, and high fidelity audio generation have become two linchpins in the machine learning research fields. However, the representation learned from an unsupervised setting does not guarantee its' usability for any downstream task at hand, which can be a wastage of the resources, if the training was conducted for that particular posterior job. Also, during the representation learning, if the model is highly biased towards the downstream task, it losses its generalisation capability which directly benefits the downstream job but the ability to scale it to other related task is lost. Therefore, to fill this gap, we propose a new autoencoder based model named "Guided Adversarial Autoencoder (GAAE)", which can learn both post-task-specific representations and the general representation capturing the factors of variation in the training data leveraging a small percentage of labelled samples; thus, makes it suitable
Authors
(none)
Tags
Stats
Related papers
- Enhancing Unsupervised Audio Representation Learning Via Adversarial Sample Generation (2023)0.00
- Audioldm 2: Learning Holistic Audio Generation With Self-supervised Pretraining (2023)0.00
- Audio Language Modeling Using Perceptually-guided Discrete Representations (2022)0.00
- Audiogen: Textually Guided Audio Generation (2022)0.00
- Bandwidth Extension On Raw Audio Via Generative Adversarial Networks (2019)0.00
- RAVE: A Variational Autoencoder For Fast And High-quality Neural Audio Synthesis (2021)0.00
- Guided Variational Autoencoder For Speech Enhancement With A Supervised Classifier (2021)8.60
- Learning And Controlling The Source-filter Representation Of Speech With A Variational Autoencoder (2022)7.50