Shemo -- A Large-scale Validated Database For Persian Speech Emotion Detection
2019 · Omid Mohamad Nezami, Paria Jamshid Lou, Mansoureh Karami
Abstract
This paper introduces a large-scale, validated database for Persian called Sharif Emotional Speech Database (ShEMO). The database includes 3000 semi-natural utterances, equivalent to 3 hours and 25 minutes of speech data extracted from online radio plays. The ShEMO covers speech samples of 87 native-Persian speakers for five basic emotions including anger, fear, happiness, sadness and surprise, as well as neutral state. Twelve annotators label the underlying emotional state of utterances and majority voting is used to decide on the final labels. According to the kappa measure, the inter-annotator agreement is 64% which is interpreted as "substantial agreement". We also present benchmark results based on common classification methods in speech emotion detection task. According to the experiments, support vector machine achieves the best results for both gender-independent (58.2%) and gender-dependent models (female=59.4%, male=57.6%). The ShEMO is available for academic purposes free of
Authors
(none)
Tags
Stats
Related papers
- EMOVOME: A Dataset For Emotion Recognition In Spontaneous Real-life Speech (2024)0.00
- Emotional Voice Messages (EMOVOME) Database: Emotion Recognition In Spontaneous Voice Messages (2024)0.00
- Emospeech: A Corpus Of Emotionally Rich And Contextually Detailed Speech Annotations (2024)0.00
- Construction And Evaluation Of Mandarin Multimodal Emotional Speech Database (2024)0.00
- The Emotional Voices Database: Towards Controlling The Emotion Dimension In Voice Generation Systems (2018)0.00
- Emobox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit And Benchmark (2024)11.49
- EMOVIE: A Mandarin Emotion Speech Dataset With A Simple Emotional Text-to-speech Model (2021)0.00
- CAMEO: Collection Of Multilingual Emotional Speech Corpora (2025)0.00