Autoencoder Based Architecture For Fast & Real Time Audio Style Transfer
2018 Β· Dhruv Ramani, Samarjit Karmakar, Anirban Panda, et al.
Abstract
Recently, there has been great interest in the field of audio style transfer, where a stylized audio is generated by imposing the style of a reference audio on the content of a target audio. We improve on the current approaches which use neural networks to extract the content and the style of the audio signal and propose a new autoencoder based architecture for the task. This network generates a stylized audio for a content audio in a single forward pass. The proposed network architecture proves to be advantageous over the quality of audio produced and the time taken to train the network. The network is experimented on speech signals to confirm the validity of our proposal.
Authors
(none)
Tags
Stats
Related papers
- Time Domain Neural Audio Style Transfer (2017)0.00
- Unsupervised Audiovisual Synthesis Via Exemplar Autoencoders (2020)0.00
- AUTOVC: Zero-shot Voice Style Transfer With Only Autoencoder Loss (2019)0.00
- Towards Evaluating The Robustness Of Automatic Speech Recognition Systems Via Audio Style Transfer (2024)4.52
- Timbre Transfer With Variational Auto Encoding And Cycle-consistent Adversarial Networks (2021)0.00
- Improving Performance Of Seen And Unseen Speech Style Transfer In End-to-end Neural TTS (2021)6.34
- Learning Latent Representations For Style Control And Transfer In End-to-end Speech Synthesis (2018)0.00
- A Multiscale Autoencoder (MSAE) Framework For End-to-end Neural Network Speech Enhancement (2023)6.34