Non-intrusive Speech Quality Assessment Using Neural Networks
2019 Β· Anderson R. Avila, Hannes Gamper, Chandan Reddy, et al.
Abstract
Estimating the perceived quality of an audio signal is critical for many multimedia and audio processing systems. Providers strive to offer optimal and reliable services in order to increase the user quality of experience (QoE). In this work, we present an investigation of the applicability of neural networks for non-intrusive audio quality assessment. We propose three neural network-based approaches for mean opinion score (MOS) estimation. We compare our results to three instrumental measures: the perceptual evaluation of speech quality (PESQ), the ITU-T Recommendation P.563, and the speech-to-reverberation energy ratio. Our evaluation uses a speech dataset contaminated with convolutive and additive noise, labeled using a crowd-based QoE evaluation, evaluated with Pearson correlation with MOS labels, and mean-squared-error of the estimated MOS. Our proposed approaches outperform the aforementioned instrumental measures, with a fully connected deep neural network using Mel-frequency fe
Authors
(none)
Tags
Stats
Related papers
- Metricnet: Towards Improved Modeling For Non-intrusive Speech Quality Assessment (2021)0.00
- Attention-based Speech Enhancement Using Human Quality Perception Modelling (2023)0.00
- Comparison Of Speech Representations For Automatic Quality Estimation In Multi-speaker Text-to-speech Synthesis (2020)0.00
- MMMOS: Multi-domain Multi-axis Audio Quality Assessment (2025)0.00
- Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model (2023)0.00
- More For Less: Non-intrusive Speech Quality Assessment With Limited Annotations (2021)7.16
- Quality-net: An End-to-end Non-intrusive Speech Quality Assessment Model Based On BLSTM (2018)15.62
- Pre-trained Speech Representations As Feature Extractors For Speech Quality Assessment In Online Conferencing Applications (2022)5.84