Joint Far- And Near-end Speech Intelligibility Enhancement Based On The Approximated Speech Intelligibility Index
2021 · Andreas Jonas Fuglsig, Jan Østergaard, Jesper Jensen, et al.
Abstract
This paper considers speech enhancement of signals picked up in one noisy environment which must be presented to a listener in another noisy environment. Recently, it has been shown that an optimal solution to this problem requires the consideration of the noise sources in both environments jointly. However, the existing optimal mutual information based method requires a complicated system model that includes natural speech variations, and relies on approximations and assumptions of the underlying signal distributions. In this paper, we propose to use a simpler signal model and optimize speech intelligibility based on the Approximated Speech Intelligibility Index (ASII). We derive a closed-form solution to the joint far- and near-end speech enhancement problem that is independent of the marginal distribution of signal coefficients, and that achieves similar performance to existing work. In addition, we do not need to model or optimize for natural speech variations.
Authors
(none)
Tags
Stats
Related papers
- Multi-metric Optimization Using Generative Adversarial Networks For Near-end Speech Intelligibility Enhancement (2021)8.60
- Monaural Speech Enhancement Using Deep Neural Networks By Maximizing A Short-time Objective Intelligibility Measure (2018)11.76
- Speech Enhancement In Adverse Environments Based On Non-stationary Noise-driven Spectral Subtraction And Snr-dependent Phase Compensation (2018)0.00
- Improved Far-field Speech Recognition Using Joint Variational Autoencoder (2022)0.00
- Model-based Speech Enhancement In The Modulation Domain (2017)10.07
- Interactive Feature Fusion For End-to-end Noise-robust Speech Recognition (2021)12.10
- On The Relationship Between Short-time Objective Intelligibility And Short-time Spectral-amplitude Mean-square Error For Speech Enhancement (2018)9.23
- Imetricgan: Intelligibility Enhancement For Speech-in-noise Using Generative Adversarial Network-based Metric Learning (2020)9.41