Transferable discriminant linear regression for cross-corpus speech emotion recognition

被引:7
|
作者
Li, Shaokai [1 ]
Song, Peng [1 ]
Zhang, Wenjing [1 ]
机构
[1] Yantai Univ, Sch Comp & Control Engn, Yantai 264005, Peoples R China
基金
中国国家自然科学基金;
关键词
Linear regression; Speech emotion recognition; Category space; Transfer learning; LEAST-SQUARES REGRESSION; GENERAL FRAMEWORK; FEATURES; REGULARIZATION; CLASSIFICATION; ADAPTATION; DATABASES;
D O I
10.1016/j.apacoust.2022.108919
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech emotion recognition (SER) has attracted much interest recently due to its wide applications. However, it should be noted that most SER methods are conducted on the assumption that the training and testing data are from the same database. In real applications, this assumption does not hold, and the recognition performance will be significantly degraded. To solve this problem, we present a novel trans-ferable discriminant linear regression (TDLR) approach for cross-corpus SER. Specifically, first, we intro-duce a non-negative label relaxation linear regression on source corpus to help learn transferable feature representations. Second, we propose a simple but effective strategy to keep the linear relationship between the labels of source and target corpora. Meanwhile, we utilize the discriminative maximum mean discrepancy (MMD) as the distance metric between two databases. Furthermore, we use the graph Laplacian to preserve the geometric structure of samples, which can further reduce the distribution gap between the two databases. Additionally, to better obtain the intrinsic properties of data and make the model robust, we impose an '2;1-norm on the transformation matrices. Extensive experiments have been carried out on several standard databases, and the results show that TDLR can obtain better recognition performance than several state-of-the-art algorithms. (C) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition
    Zhang, Weijian
    Song, Peng
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 307 - 318
  • [2] A CROSS-CORPUS STUDY ON SPEECH EMOTION RECOGNITION
    Milner, Rosanna
    Jalal, Md Asif
    Ng, Raymond W. M.
    Hain, Thomas
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 304 - 311
  • [3] Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition
    Song, Peng
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (02) : 265 - 275
  • [4] Deep Transductive Transfer Regression Network for Cross-Corpus Speech Emotion Recognition
    Zhao, Yan
    Wang, Jincen
    Ye, Ru
    Zong, Yuan
    Zheng, Wenming
    Zhao, Li
    INTERSPEECH 2022, 2022, : 371 - 375
  • [5] CROSS-CORPUS SPEECH EMOTION RECOGNITION USING JOINT DISTRIBUTION ADAPTIVE REGRESSION
    Zhang, Jiacheng
    Jiang, Lin
    Zong, Yuan
    Zheng, Wenming
    Zhao, Li
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3790 - 3794
  • [6] Cross-Corpus Speech Emotion Recognition Based on Joint Transfer Subspace Learning and Regression
    Zhang, Weijian
    Song, Peng
    Chen, Dongliang
    Sheng, Chao
    Zhang, Wenjing
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) : 588 - 598
  • [7] Cross-corpus Speech Emotion Recognition Using Transfer Semi-supervised Discriminant Analysis
    Song, Peng
    Zhang, Xinran
    Ou, Shifeng
    Liu, Jingjing
    Yu, Yanwei
    Zheng, Wenming
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [8] A STUDY ON CROSS-CORPUS SPEECH EMOTION RECOGNITION AND DATA AUGMENTATION
    Braunschweiler, Norbert
    Doddipatla, Rama
    Keizer, Simon
    Stoyanchev, Svetlana
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 24 - 30
  • [9] Cross-Corpus Speech Emotion Recognition Based on Causal Emotion Information Representation
    Fu, Hongliang
    Li, Qianqian
    Tao, Huawei
    Zhu, Chunhua
    Xie, Yue
    Guo, Ruxue
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (08) : 1097 - 1100
  • [10] Implicitly Aligning Joint Distributions for Cross-Corpus Speech Emotion Recognition
    Lu, Cheng
    Zong, Yuan
    Tang, Chuangao
    Lian, Hailun
    Chang, Hongli
    Zhu, Jie
    Li, Sunan
    Zhao, Yan
    ELECTRONICS, 2022, 11 (17)