Informative Speech Features based on Emotion Classes and Gender in Explainable Speech Emotion Recognition

被引:1
作者
Yildirim, Huseyin Ediz [1 ]
Iren, Deniz [1 ]
机构
[1] Open Univ Netherlands, Informat Sci, Heerlen, Netherlands
来源
2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW | 2023年
关键词
speech emotion recognition; affective computing; explainable machine learning; feature selection;
D O I
10.1109/ACIIW59127.2023.10388158
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotions manifest in various aspects of human speech. While the tonality of the speech is a crucial indicator of emotions, other aspects such as word selection, pronunciation, and other paralinguistic features also provide valuable insights. Some of these aspects are considered universal, others are influenced by cultural and personal aspects, with gender being one of the most significant factors affecting emotional expressions. In this study, we aimed at investigating the effect of gender on emotional descriptors in speech. Specifically, we used intelligible paralinguistic speech features in Speech Emotion Recognition and employed Shapley values to measure the effect of gender on speech features. Furthermore, we empirically evaluated whether a reduced set of informative features could provide sufficient information for emotion recognition. Additionally, we investigated how gender influences auditory expressions of emotions. Our experiments show that besides the physical impact on fundamental speech frequencies, gender also affects how emotional phrases are spoken, and how prosody and phonology change. In addition to that, reducing the input size using the feature informativeness does not have a significant effect on the model accuracy whereas it shrinks the input size drastically by 98% on average. Finally, our comparative experiments on genders show that some speech features are more informative for capturing particular emotions exhibited by different genders. Therefore, we report that with a multi-layer feature set that consists of obscure and interpretable paralinguistic features, a novel data fusion approach could yield an explainable speech emotion recognition model. Furthermore, it is possible to reduce the input size and computational requirements by implementing feature reduction and gender information for speech emotion recognition tasks.
引用
收藏
页数:8
相关论文
共 37 条
  • [1] Glottal Flow Patterns Analyses for Parkinson's Disease Detection: Acoustic and Nonlinear Approaches
    Alexander Belalcazar-Bolanos, Elkyn
    Rafael Orozco-Arroyave, Juan
    Francisco Vargas-Bonilla, Jesus
    Haderlein, Tino
    Noeth, Elmar
    [J]. TEXT, SPEECH, AND DIALOGUE, 2016, 9924 : 400 - 407
  • [2] Normalized amplitude quotient for parametrization of the glottal flow
    Alku, P
    Bäckström, T
    Vilkman, E
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (02) : 701 - 710
  • [3] Acoustic profiles in vocal emotion expression
    Banse, R
    Scherer, KR
    [J]. JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1996, 70 (03) : 614 - 636
  • [4] Burkhardt Felix, 2005, P 9 EUR C SPEECH COM
  • [5] Busso C., 2013, Social Emotions in Nature and Artifact, P110, DOI DOI 10.1093/ACPROF:OSO/9780195387643.003.0008
  • [6] Speech Emotion Recognition Using Audio Matching
    Chaturvedi, Iti
    Noel, Tim
    Satapathy, Ranjan
    [J]. ELECTRONICS, 2022, 11 (23)
  • [8] Dahake PP, 2016, 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), P1080, DOI 10.1109/ICACDOT.2016.7877753
  • [9] Modeling prosodic features with joint factor analysis for speaker verification
    Dehak, Najim
    Dumouchel, Pierre
    Kenny, Patrick
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (07): : 2095 - 2103
  • [10] The Effects of Aging on Acoustic Parameters of Voice
    Dehqan, Ali
    Scherer, Ronald C.
    Dashti, Gholamali
    Ansari-Moghaddam, Alireza
    Fanaie, Sepideh
    [J]. FOLIA PHONIATRICA ET LOGOPAEDICA, 2012, 64 (06) : 265 - 270