EMNS /imz/ Corpus: An Emotive Single-speaker Dataset For Narrative Storytelling In Games, Television And Graphic Novels
2023 · Kari Ali Noriy, Xiaosong Yang, Jian Jun Zhang
Abstract
The increasing adoption of text-to-speech technologies has led to a growing demand for natural and emotive voices that adapt to a conversation's context and emotional tone. The Emotive Narrative Storytelling (EMNS) corpus is a unique speech dataset created to enhance conversations' expressiveness and emotive quality in interactive narrative-driven systems. The corpus consists of a 2.3-hour recording featuring a female speaker delivering labelled utterances. It encompasses eight acted emotional states, evenly distributed with a variance of 0.68%, along with expressiveness levels and natural language descriptions with word emphasis labels. The evaluation of audio samples from different datasets revealed that the EMNS corpus achieved the highest average scores in accurately conveying emotions and demonstrating expressiveness. It outperformed other datasets in conveying shared emotions and achieved comparable levels of genuineness. A classification task confirmed the accurate representatio
Authors
(none)
Tags
Stats
Related papers
- Emospeech: A Corpus Of Emotionally Rich And Contextually Detailed Speech Annotations (2024)0.00
- EMOVIE: A Mandarin Emotion Speech Dataset With A Simple Emotional Text-to-speech Model (2021)0.00
- EMOVOME: A Dataset For Emotion Recognition In Spontaneous Real-life Speech (2024)0.00
- Emotional Voice Messages (EMOVOME) Database: Emotion Recognition In Spontaneous Voice Messages (2024)0.00
- The Emotional Voices Database: Towards Controlling The Emotion Dimension In Voice Generation Systems (2018)0.00
- A Methodology For Controlling The Emotional Expressiveness In Synthetic Speech -- A Deep Learning Approach (2019)5.84
- Storytts: A Highly Expressive Text-to-speech Dataset With Rich Textual Expressiveness Annotations (2024)3.58
- Detecting Emotion Carriers By Combining Acoustic And Lexical Representations (2021)3.58