Quality-net: An End-to-end Non-intrusive Speech Quality Assessment Model Based On BLSTM
2018 Β· Szu-Wei Fu, Yu Tsao, Hsin-Te Hwang, et al.
Abstract
Nowadays, most of the objective speech quality assessment tools (e.g., perceptual evaluation of speech quality (PESQ)) are based on the comparison of the degraded/processed speech with its clean counterpart. The need of a "golden" reference considerably restricts the practicality of such assessment tools in real-world scenarios since the clean reference usually cannot be accessed. On the other hand, human beings can readily evaluate the speech quality without any reference (e.g., mean opinion score (MOS) tests), implying the existence of an objective and non-intrusive (no clean reference needed) quality assessment mechanism. In this study, we propose a novel end-to-end, non-intrusive speech quality evaluation model, termed Quality-Net, based on bidirectional long short-term memory. The evaluation of utterance-level quality in Quality-Net is based on the frame-level assessment. Frame constraints and sensible initializations of forget gate biases are applied to learn meaningful frame-lev
Authors
(none)
Tags
Stats
Related papers
- Metricnet: Towards Improved Modeling For Non-intrusive Speech Quality Assessment (2021)0.00
- Non-intrusive Speech Quality Assessment Using Neural Networks (2019)13.74
- Multi-task Pseudo-label Learning For Non-intrusive Speech Quality Assessment Model (2023)0.00
- Stoi-net: A Deep Learning Based Non-intrusive Speech Intelligibility Assessment Model (2020)0.00
- More For Less: Non-intrusive Speech Quality Assessment With Limited Annotations (2021)7.16
- Inqss: A Speech Intelligibility And Quality Assessment Model Using A Multi-task Learning Network (2021)9.76
- Attentivemos: A Lightweight Attention-only Model For Speech Quality Prediction (2024)3.58
- Attention-based Speech Enhancement Using Human Quality Perception Modelling (2023)0.00