Mexican Emotional Speech Database Based on Semantic, Frequency, Familiarity, Concreteness, and Cultural Shaping of Affective Prosody

被引:9
作者
Duville, Mathilde Marie [1 ]
Alonso-Valerdi, Luz Maria [1 ]
Ibarra-Zarate, David I. [1 ]
机构
[1] Tecnol Monterrey, Escuela Ingn Ciencias, Ave Eugenio Garza Sada 2501, Monterrey 64849, Mexico
关键词
affective computing; audio database; cross-cultural; machine learning; Mexican Spanish; emotional speech; paralinguistic information; discrete emotions; AFFECTIVE NORMS; SPANISH WORDS; RECOGNITION; FEATURES; EXPRESSION; CLASSIFICATION; VOICE; PERCEPTION; CATEGORIES; DIALECTS;
D O I
10.3390/data6120130
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the Mexican Emotional Speech Database (MESD) that contains single-word emotional utterances for anger, disgust, fear, happiness, neutral and sadness with adult (male and female) and child voices is described. To validate the emotional prosody of the uttered words, a cubic Support Vector Machines classifier was trained on the basis of prosodic, spectral and voice quality features for each case study: (1) male adult, (2) female adult and (3) child. In addition, cultural, semantic, and linguistic shaping of emotional expression was assessed by statistical analysis. This study was registered at BioMed Central and is part of the implementation of a published study protocol. Mean emotional classification accuracies yielded 93.3%, 89.4% and 83.3% for male, female and child utterances respectively. Statistical analysis emphasized the shaping of emotional prosodies by semantic and linguistic features. A cultural variation in emotional expression was highlighted by comparing the MESD with the INTERFACE for Castilian Spanish database. The MESD provides reliable content for linguistic emotional prosody shaped by the Mexican cultural environment. In order to facilitate further investigations, a corpus controlled for linguistic features and emotional semantics, as well as one containing words repeated across voices and emotions are provided. The MESD is made freely available.
引用
收藏
页数:34
相关论文
共 73 条
  • [1] Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers
    Akcay, Mehmet Berkehan
    Oguz, Kaya
    [J]. SPEECH COMMUNICATION, 2020, 116 (116) : 56 - 76
  • [2] Automatic Voice Pathology Monitoring Using Parallel Deep Models for Smart Healthcare
    Alhussein, Musaed
    Muhammad, Ghulam
    [J]. IEEE ACCESS, 2019, 7 : 46474 - 46479
  • [3] Alonso-Valerdi L.M., 2021, **DATA OBJECT**, VV2, P1644, DOI [10.17632/cy34mh68j9.2, DOI 10.17632/CY34MH68J9.2]
  • [4] Emotional Intelligence and the Practice of Organized Physical-Sport Activity in Children
    Amado-Alonso, Diana
    Leon-del-Barco, Benito
    Mendo-Lazaro, Santiago
    Sanchez-Miguel, Pedro A.
    Iglesias Gallego, Damian
    [J]. SUSTAINABILITY, 2019, 11 (06):
  • [5] Improved speech emotion recognition with Mel frequency magnitude coefficient
    Ancilin, J.
    Milton, A.
    [J]. APPLIED ACOUSTICS, 2021, 179
  • [6] Beyond Correlation: Acoustic Transformation Methods for the Experimental Study of Emotional Voice and Speech
    Arias, Pablo
    Rachman, Laura
    Liuni, Marco
    Aucouturier, Jean-Julien
    [J]. EMOTION REVIEW, 2021, 13 (01) : 12 - 24
  • [7] Feature Selection for Speech Emotion Recognition in Spanish and Basque: On the Use of Machine Learning to Improve Human-Computer Interaction
    Arruti, Andoni
    Cearreta, Idoia
    Alvarez, Aitor
    Lazkano, Elena
    Sierra, Basilio
    [J]. PLOS ONE, 2014, 9 (10):
  • [8] Two-stage dimensional emotion recognition by fusing predictions of acoustic and text networks using SVM
    Atmaja, Bagus Tris
    Akagi, Masato
    [J]. SPEECH COMMUNICATION, 2021, 126 : 9 - 21
  • [9] Effects of emotional valence and arousal on the voice perception network
    Bestelmeyer, Patricia E. G.
    Kotz, Sonja A.
    Belin, Pascal
    [J]. SOCIAL COGNITIVE AND AFFECTIVE NEUROSCIENCE, 2017, 12 (08) : 1351 - 1358
  • [10] Bagged support vector machines for emotion recognition from speech
    Bhavan, Anjali
    Chauhan, Pankaj
    Hitkul
    Shah, Rajiv Ratn
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 184