Machine Learning: Challenges, Limitations, And Compatibility For Audio Restoration Processes
2021 Β· Owen Casey, Rushit Dave, Naeem Seliya, et al.
Abstract
In this paper machine learning networks are explored for their use in restoring degraded and compressed speech audio. The project intent is to build a new trained model from voice data to learn features of compression artifacting distortion introduced by data loss from lossy compression and resolution loss with an existing algorithm presented in SEGAN: Speech Enhancement Generative Adversarial Network. The resulting generator from the model was then to be used to restore degraded speech audio. This paper details an examination of the subsequent compatibility and operational issues presented by working with deprecated code, which obstructed the trained model from successfully being developed. This paper further serves as an examination of the challenges, limitations, and compatibility in the current state of machine learning.
Authors
(none)
Tags
Stats
Related papers
- Effect Of Noise Suppression Losses On Speech Distortion And ASR Performance (2021)10.74
- Active Restoration Of Lost Audio Signals Using Machine Learning And Latent Information (2021)0.00
- A Consolidated View Of Loss Functions For Supervised Deep Learning-based Speech Enhancement (2020)13.93
- Adversarial Machine Learning And Speech Emotion Recognition: Utilizing Generative Adversarial Networks For Robustness (2018)0.00
- SEGAN: Speech Enhancement Generative Adversarial Network (2017)21.85
- Restorative Speech Enhancement: A Progressive Approach Using SE And Codec Modules (2024)0.00
- Cheapnet: Improving Light-weight Speech Enhancement Network By Projected Loss Function (2023)0.00
- Boosting Noise Robustness Of Acoustic Model Via Deep Adversarial Training (2018)9.23