The Emotional Voices Database: Towards Controlling The Emotion Dimension In Voice Generation Systems
2018 · Adaeze Adigwe, Noé Tits, Kevin El Haddad, et al.
Abstract
In this paper, we present a database of emotional speech intended to be open-sourced and used for synthesis and generation purpose. It contains data for male and female actors in English and a male actor in French. The database covers 5 emotion classes so it could be suitable to build synthesis and voice transformation systems with the potential to control the emotional dimension in a continuous way. We show the data's efficiency by building a simple MLP system converting neutral to angry speech style and evaluate it via a CMOS perception test. Even though the system is a very simple one, the test show the efficiency of the data which is promising for future work.
Authors
(none)
Tags
Stats
Related papers
- Emotional Voice Conversion: Theory, Databases And ESD (2021)16.30
- A Methodology For Controlling The Emotional Expressiveness In Synthetic Speech -- A Deep Learning Approach (2019)5.84
- Construction And Evaluation Of Mandarin Multimodal Emotional Speech Database (2024)0.00
- Emospeech: A Corpus Of Emotionally Rich And Contextually Detailed Speech Annotations (2024)0.00
- Seen And Unseen Emotional Style Transfer For Voice Conversion With A New Emotional Speech Dataset (2020)16.34
- Emotional Dimension Control In Language Model-based Text-to-speech: Spanning A Broad Spectrum Of Human Emotions (2024)0.00
- EMOVIE: A Mandarin Emotion Speech Dataset With A Simple Emotional Text-to-speech Model (2021)0.00
- Emotional Voice Messages (EMOVOME) Database: Emotion Recognition In Spontaneous Voice Messages (2024)0.00