Transferable discriminant linear regression for cross-corpus speech emotion recognition

被引：7

作者：

Li, Shaokai ^{[1
]}

Song, Peng ^{[1
]}

Zhang, Wenjing ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China

来源：

APPLIED ACOUSTICS | 2022年 / 197卷

基金：

中国国家自然科学基金;

关键词：

Linear regression; Speech emotion recognition; Category space; Transfer learning; LEAST-SQUARES REGRESSION; GENERAL FRAMEWORK; FEATURES; REGULARIZATION; CLASSIFICATION; ADAPTATION; DATABASES;

D O I：

10.1016/j.apacoust.2022.108919

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Speech emotion recognition (SER) has attracted much interest recently due to its wide applications. However, it should be noted that most SER methods are conducted on the assumption that the training and testing data are from the same database. In real applications, this assumption does not hold, and the recognition performance will be significantly degraded. To solve this problem, we present a novel trans-ferable discriminant linear regression (TDLR) approach for cross-corpus SER. Specifically, first, we intro-duce a non-negative label relaxation linear regression on source corpus to help learn transferable feature representations. Second, we propose a simple but effective strategy to keep the linear relationship between the labels of source and target corpora. Meanwhile, we utilize the discriminative maximum mean discrepancy (MMD) as the distance metric between two databases. Furthermore, we use the graph Laplacian to preserve the geometric structure of samples, which can further reduce the distribution gap between the two databases. Additionally, to better obtain the intrinsic properties of data and make the model robust, we impose an '2;1-norm on the transformation matrices. Extensive experiments have been carried out on several standard databases, and the results show that TDLR can obtain better recognition performance than several state-of-the-art algorithms. (C) 2022 Elsevier Ltd. All rights reserved.

引用

页数：11

共 50 条

[31] Progressive distribution adapted neural networks for cross-corpus speech emotion recognition
Zong, Yuan
Lian, Hailun
Zhang, Jiacheng
Feng, Ercui
Lu, Cheng
Chang, Hongli
Tang, Chuangao
FRONTIERS IN NEUROROBOTICS, 2022, 16
[32] Auditory attention model based on Chirplet for cross-corpus speech emotion recognition
Zhang X.
Song P.
Zha C.
Tao H.
Zhao L.
Zhao, Li (zhaoli@seu.edu.cn), 1600, Southeast University (32): : 402 - 407
[33] A Novel DBN Feature Fusion Model for Cross-Corpus Speech Emotion Recognition
Zou Cairong
Zhang Xinran
Zha Cheng
Zhao Li
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2016, 2016
[34] Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition
Ye, Jiaxin
Wei, Yujie
Wen, Xin-Cheng
Ma, Chenglong
Huang, Zhizhong
Liu, Kunhong
Shan, Hongming
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5956 - 5965
[35] Exploring corpus-invariant emotional acoustic feature for cross-corpus speech emotion recognition
Lian, Hailun
Lu, Cheng
Zhao, Yan
Li, Sunan
Qi, Tianhua
Zong, Yuan
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
[36] A Comparative Study on Different Labelling Schemes and Cross-Corpus Experiments in Speech Emotion Recognition
Baki, Pinar
Erden, Berna
Oncul, Serkan
29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
[37] Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
Retta, Ephrem Afele
Sutcliffe, Richard
Mahmood, Jabar
Berwo, Michael Abebe
Almekhlafi, Eiad
Khan, Sajjad Ahmad
Chaudhry, Shehzad Ashraf
Mhamed, Mustafa
Feng, Jun
APPLIED SCIENCES-BASEL, 2023, 13 (23):
[38] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
ZHAO Huijuan
YE Ning
WANG Ruchuan
ChineseJournalofElectronics, 2023, 32 (03) : 640 - 646
[39] Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies
Schuller, Bjoern
Vlasenko, Bogdan
Eyben, Florian
Woellmer, Martin
Stuhlsatz, Andre
Wendemuth, Andreas
Rigoll, Gerhard
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2010, 1 (02) : 119 - 131
[40] Improved Cross-Corpus Speech Emotion Recognition Using Deep Local Domain Adaptation
Zhao Huijuan
Ye Ning
Wang Ruchuan
CHINESE JOURNAL OF ELECTRONICS, 2023, 32 (03) : 640 - 646

← 1 2 3 4 5 →