Cross-Corpus Speech Emotion Recognition Based on Causal Emotion Information Representation

被引:0
|
作者
Fu, Hongliang [1 ]
Li, Qianqian [1 ]
Tao, Huawei [1 ]
Zhu, Chunhua [1 ]
Xie, Yue [2 ]
Guo, Ruxue [3 ]
机构
[1] Henan Univ Technol, Key Lab Grain Informat Proc & Control, Minist Educ, Zhengzhou 450001, Peoples R China
[2] Nanjing Inst Technol, Sch Commun Engn, Nanjing 211167, Peoples R China
[3] IFLYTEK Res, Hefei 230088, Peoples R China
基金
中国国家自然科学基金;
关键词
cross-corpus speech emotion recognition; causal representation learning; domain adaptation;
D O I
10.1587/transinf.2023EDL8087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech emotion recognition (SER) is a key research technology to realize the third generation of artificial intelligence, which is widely used in human-computer interaction, emotion diagnosis, interpersonal communication and other fields. However, the aliasing of language and semantic information in speech tends to distort the alignment of emotion features, which affects the performance of cross-corpus SER system. This paper proposes a cross-corpus SER model based on causal emotion information representation (CEIR). The model uses the reconstruction loss of the deep autoencoder network and the source domain label information to realize the preliminary separation of causal features. Then, the causal correlation matrix is constructed, and the local maximum mean difference (LMMD) feature alignment technology is combined to make the causal features of different dimensions jointly distributed independent. Finally, the supervised fine-tuning of labeled data is used to achieve effective extraction of causal emotion information. The experimental results show that the average unweighted average recall (UAR) of the proposed algorithm is increased by 3.4% to 7.01% compared with the latest partial algorithms in the field.
引用
收藏
页码:1097 / 1100
页数:4
相关论文
共 50 条
  • [41] A Comparative Study on Different Labelling Schemes and Cross-Corpus Experiments in Speech Emotion Recognition
    Baki, Pinar
    Erden, Berna
    Oncul, Serkan
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [42] Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
    Retta, Ephrem Afele
    Sutcliffe, Richard
    Mahmood, Jabar
    Berwo, Michael Abebe
    Almekhlafi, Eiad
    Khan, Sajjad Ahmad
    Chaudhry, Shehzad Ashraf
    Mhamed, Mustafa
    Feng, Jun
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [43] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
    ZHAO Huijuan
    YE Ning
    WANG Ruchuan
    ChineseJournalofElectronics, 2023, 32 (03) : 640 - 646
  • [44] Within and cross-corpus speech emotion recognition using latent topic model-based features
    Mohit Shah
    Chaitali Chakrabarti
    Andreas Spanias
    EURASIP Journal on Audio, Speech, and Music Processing, 2015
  • [45] Cross-Corpus Speech Emotion Recognition Based on Domain-Adaptive Least-Squares Regression
    Zong, Yuan
    Zheng, Wenming
    Zhang, Tong
    Huang, Xiaohua
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (05) : 585 - 589
  • [46] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
    Zhao Huijuan
    Ye Ning
    Wang Ruchuan
    CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (03) : 640 - 646
  • [47] Multi-scale discrepancy adversarial network for cross-corpus speech emotion recognition
    Wanlu ZHENG
    Wenming ZHENG
    Yuan ZONG
    虚拟现实与智能硬件(中英文), 2021, 3 (01) : 65 - 75
  • [48] Improving Cross-Corpus Speech Emotion Recognition with Adversarial Discriminative Domain Generalization (ADDoG)
    Gideon, John
    McInnis, Melvin G.
    Provost, Emily Mower
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (04) : 1055 - 1068
  • [49] Low-rank joint distribution adaptation for cross-corpus speech emotion recognition
    Li, Sunan
    Lu, Cheng
    Zhao, Yan
    Lian, Hailun
    Qi, Tianhua
    Zong, Yuan
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [50] Filter-based multi-task cross-corpus feature learning for speech emotion recognition
    Bakhtiari, Behzad
    Kalhor, Elham
    Ghafarian, Seyed Hossein
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (04) : 3145 - 3153