Construction And Evaluation Of Mandarin Multimodal Emotional Speech Database
2024 Β· Zhu Ting, Li Liangqi, Duan Shufei, et al.
Abstract
A multi-modal emotional speech Mandarin database including articulatory kinematics, acoustics, glottal and facial micro-expressions is designed and established, which is described in detail from the aspects of corpus design, subject selection, recording details and data processing. Where signals are labeled with discrete emotion labels (neutral, happy, pleasant, indifferent, angry, sad, grief) and dimensional emotion labels (pleasure, arousal, dominance). In this paper, the validity of dimension annotation is verified by statistical analysis of dimension annotation data. The SCL-90 scale data of annotators are verified and combined with PAD annotation data for analysis, so as to explore the internal relationship between the outlier phenomenon in annotation and the psychological state of annotators. In order to verify the speech quality and emotion discrimination of the database, this paper uses 3 basic models of SVM, CNN and DNN to calculate the recognition rate of these seven emotions
Authors
(none)
Tags
Stats
Related papers
- EMOVIE: A Mandarin Emotion Speech Dataset With A Simple Emotional Text-to-speech Model (2021)0.00
- The Emotional Voices Database: Towards Controlling The Emotion Dimension In Voice Generation Systems (2018)0.00
- Emospeech: A Corpus Of Emotionally Rich And Contextually Detailed Speech Annotations (2024)0.00
- Multimodal Speech Emotion Recognition And Ambiguity Resolution (2019)0.00
- Emotional Voice Conversion: Theory, Databases And ESD (2021)16.30
- Shemo -- A Large-scale Validated Database For Persian Speech Emotion Detection (2019)13.70
- ML-SAN: Multi-level Speaker-adaptive Network For Emotion Recognition In Conversations (2026)0.00
- Speecheq: Speech Emotion Recognition Based On Multi-scale Unified Datasets And Multitask Learning (2022)5.84