Exploring The Potential Of Data-driven Spatial Audio Enhancement Using A Single-channel Model
2024 · Arthur N. Dos Santos, Bruno S. Masiero, Túlio C. L. Mateus
Abstract
One key aspect differentiating data-driven single- and multi-channel speech enhancement and dereverberation methods is that both the problem formulation and complexity of the solutions are considerably more challenging in the latter case. Additionally, with limited computational resources, it is cumbersome to train models that require the management of larger datasets or those with more complex designs. In this scenario, an unverified hypothesis that single-channel methods can be adapted to multi-channel scenarios simply by processing each channel independently holds significant implications, boosting compatibility between sound scene capture and system input-output formats, while also allowing modern research to focus on other challenging aspects, such as full-bandwidth audio enhancement, competitive noise suppression, and unsupervised learning. This study verifies this hypothesis by comparing the enhancement promoted by a basic single-channel speech enhancement and dereverberation mo
Authors
(none)
Tags
Stats
Related papers
- Exploiting Single-channel Speech For Multi-channel End-to-end Speech Recognition (2021)0.00
- Insights Into Deep Non-linear Filters For Improved Multi-channel Speech Enhancement (2022)13.93
- Efficient Multi-channel Speech Enhancement With Spherical Harmonics Injection For Directional Encoding (2023)3.58
- Multi-channel Speaker Verification For Single And Multi-talker Speech (2020)0.00
- Multi-channel Target Speech Extraction With Channel Decorrelation And Target Speaker Adaptation (2020)0.00
- Decoupled Spatial And Temporal Processing For Resource Efficient Multichannel Speech Enhancement (2024)0.00
- End-to-end Multi-channel Speaker Extraction And Binaural Speech Synthesis (2024)0.00
- Hierarchical Modeling Of Spatial Cues Via Spherical Harmonics For Multi-channel Speech Enhancement (2023)0.00