A Comparative Study on Different Labelling Schemes and Cross-Corpus Experiments in Speech Emotion Recognition

被引:0
|
作者
Baki, Pinar [1 ]
Erden, Berna [1 ]
Oncul, Serkan [1 ]
机构
[1] Arcel Arastirma Gelistirme Merkezi, Istanbul, Turkey
关键词
speech emotion recognition; cross-corpus training; emotion categories; audio classification;
D O I
10.1109/SIU53274.2021.9477924
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Performance of the speech emotion recognition systems depends on many factors such as quality of the speech data, environment, cultural differences, language, emotion categorization scheme, etc. In this work, we create a baseline speech emotion recognition model based on convolutional neural networks using the RAVDESS dataset. First, we compare the performance of the model with different labeling schemes. Then, we perform cross-corpus experiments on datasets recorded in different languages. The results show that emotion groups with common arousal or valence categories are often confused and using multiple corpora in training improves the generalization capacity of the model.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] CROSS-CORPUS SPEECH EMOTION RECOGNITION USING JOINT DISTRIBUTION ADAPTIVE REGRESSION
    Zhang, Jiacheng
    Jiang, Lin
    Zong, Yuan
    Zheng, Wenming
    Zhao, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3790 - 3794
  • [32] Auditory attention model based on Chirplet for cross-corpus speech emotion recognition
    Zhang X.
    Song P.
    Zha C.
    Tao H.
    Zhao L.
    Zhao, Li (zhaoli@seu.edu.cn), 1600, Southeast University (32): : 402 - 407
  • [33] A Novel DBN Feature Fusion Model for Cross-Corpus Speech Emotion Recognition
    Zou Cairong
    Zhang Xinran
    Zha Cheng
    Zhao Li
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
  • [34] Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition
    Ye, Jiaxin
    Wei, Yujie
    Wen, Xin-Cheng
    Ma, Chenglong
    Huang, Zhizhong
    Liu, Kunhong
    Shan, Hongming
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5956 - 5965
  • [35] Exploring corpus-invariant emotional acoustic feature for cross-corpus speech emotion recognition
    Lian, Hailun
    Lu, Cheng
    Zhao, Yan
    Li, Sunan
    Qi, Tianhua
    Zong, Yuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [36] Cross-Corpus Speech Emotion Recognition Based on Joint Transfer Subspace Learning and Regression
    Zhang, Weijian
    Song, Peng
    Chen, Dongliang
    Sheng, Chao
    Zhang, Wenjing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 588 - 598
  • [37] Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
    Retta, Ephrem Afele
    Sutcliffe, Richard
    Mahmood, Jabar
    Berwo, Michael Abebe
    Almekhlafi, Eiad
    Khan, Sajjad Ahmad
    Chaudhry, Shehzad Ashraf
    Mhamed, Mustafa
    Feng, Jun
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [38] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
    ZHAO Huijuan
    YE Ning
    WANG Ruchuan
    ChineseJournalofElectronics, 2023, 32 (03) : 640 - 646
  • [39] Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies
    Schuller, Bjoern
    Vlasenko, Bogdan
    Eyben, Florian
    Woellmer, Martin
    Stuhlsatz, Andre
    Wendemuth, Andreas
    Rigoll, Gerhard
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2010, 1 (02) : 119 - 131
  • [40] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
    Zhao Huijuan
    Ye Ning
    Wang Ruchuan
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (03) : 640 - 646