Recognition of Cross-Language Acoustic Emotional Valence Using Stacked Ensemble Learning

被引:7
|
作者
Zvarevashe, Kudakwashe [1 ]
Olugbara, Oludayo O. [1 ]
机构
[1] Durban Univ Technol, South Africa Luban Workshop, ICT & Soc Res Grp, ZA-4001 Durban, South Africa
关键词
deep learning; ensemble learning; feature elimination; feature selection; speech emotion; speech recognition; SPEECH; FEATURES;
D O I
10.3390/a13100246
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most of the studies on speech emotion recognition have used single-language corpora, but little research has been done in cross-language valence speech emotion recognition. Research has shown that the models developed for single-language speech recognition systems perform poorly when used in different environments. Cross-language speech recognition is a craving alternative, but it is highly challenging because the corpora used will have been recorded in different environments and under varying conditions. The differences in the quality of recording devices, elicitation techniques, languages, and accents of speakers make the recognition task even more arduous. In this paper, we propose a stacked ensemble learning algorithm to recognize valence emotion in a cross-language speech environment. The proposed ensemble algorithm was developed from random decision forest, AdaBoost, logistic regression, and gradient boosting machine and is therefore called RALOG. In addition, we propose feature scaling using random forest recursive feature elimination and a feature selection algorithm to boost the performance of RALOG. The algorithm has been evaluated against four widely used ensemble algorithms to appraise its performance. The amalgam of five benchmarked corpora has resulted in a cross-language corpus to validate the performance of RALOG trained with the selected acoustic features. The comparative analysis results have shown that RALOG gave better performance than the other ensemble learning algorithms investigated in this study.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Cross-language Transfer Speech Recognition using Deep Learning
    Zhao, Yue
    Xu, Yan M.
    Sun, Mei J.
    Xu, Xiao N.
    Wang, Hui
    Yang, Guo S.
    Ji, Qiang
    11TH IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION (ICCA), 2014, : 1422 - 1426
  • [2] Cross-Language Acoustic Emotion Recognition: An Overview and Some Tendencies
    Feraru, Silvia Monica
    Schuller, Dagmar
    Schuller, Bjoern
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 125 - 131
  • [3] Cross-language use of acoustic information for automatic speech recognition
    Nieuwoudt, C
    Botha, EC
    SPEECH COMMUNICATION, 2002, 38 (1-2) : 101 - 113
  • [4] Cross-language adaptation of acoustic models in automatic speech recognition
    Univ of Pretoria, Pretoria, South Africa
    IEEE AFRICON Conf, (181-184):
  • [5] Cross-language Speech Attribute Detection and Phone Recognition for Tibetan Using Deep Learning
    Wang, Hui
    Zhao, Yue
    Xu, Yanmin
    Xu, Xiaona
    Suo, Xingmei
    Ji, Qiang
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 474 - +
  • [6] Cross-Language Speech Emotion Recognition Via Multiple Kernel Learning
    Zha, Cheng
    2019 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2019, : 208 - 209
  • [7] CLEM: A cross-language emotional metanorm in children
    Belmon, Johanne
    Noyer-Martin, Magali
    Jhean-Larose, Sandra
    FIRST LANGUAGE, 2024, 44 (05) : 493 - 506
  • [8] Cross-language acoustic model refinement forthe Indonesian language
    Martin, T
    Sridharan, S
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 865 - 868
  • [9] Cross-Language Learning for Product Matching
    Peeters, Ralph
    Bizer, Christian
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 236 - 238
  • [10] Cross-Language Text Classification using Structural Correspondence Learning
    Prettenhofer, Peter
    Stein, Benno
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1118 - 1127