Semi-supervised Ladder Networks for Speech Emotion Recognition

被引:0
|
作者
Jian-Hua Tao
Jian Huang
Ya Li
Zheng Lian
Ming-Yue Niu
机构
[1] National Laboratory of Pattern Recognition,School of Artificial Intelligence
[2] University of Chinese Academy of Science (CAS),CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation
[3] Chinese Academy of Sciences,undefined
来源
International Journal of Automation and Computing | 2019年 / 16卷
关键词
Speech emotion recognition; the ladder network; semi-supervised learning; autoencoder; regularization;
D O I
暂无
中图分类号
学科分类号
摘要
As a major component of speech signal processing, speech emotion recognition has become increasingly essential to understanding human communication. Benefitting from deep learning, many researchers have proposed various unsupervised models to extract effective emotional features and supervised models to train emotion recognition systems. In this paper, we utilize semi-supervised ladder networks for speech emotion recognition. The model is trained by minimizing the supervised loss and auxiliary unsupervised cost function. The addition of the unsupervised auxiliary task provides powerful discriminative representations of the input features, and is also regarded as the regularization of the emotional supervised task. We also compare the ladder network with other classical autoencoder structures. The experiments were conducted on the interactive emotional dyadic motion capture (IEMOCAP) database, and the results reveal that the proposed methods achieve superior performance with a small number of labelled data and achieves better performance than other methods.
引用
收藏
页码:437 / 448
页数:11
相关论文
共 50 条
  • [41] A review on semi-supervised learning for EEG-based emotion recognition
    Qiu, Sen
    Chen, Yongtao
    Yang, Yulin
    Wang, Pengfei
    Wang, Zhelong
    Zhao, Hongyu
    Kang, Yuntong
    Nie, Ruicheng
    INFORMATION FUSION, 2024, 104
  • [42] Semi-Supervised Learning for Continuous Emotion Recognition Based on Metric Learning
    Choi, Dong Yoon
    Song, Byung Cheol
    IEEE ACCESS, 2020, 8 : 113443 - 113455
  • [43] Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning
    Sun, Yifu
    Zhang, Xulong
    Wang, Jianzong
    Cheng, Ning
    Hu, Kaiyu
    Xiao, Jing
    INTERSPEECH 2023, 2023, : 5456 - 5460
  • [44] Deep Recurrent Semi-Supervised EEG Representation Learning for Emotion Recognition
    Zhang, Guangyi
    Teinad, Ali, I
    2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2021,
  • [45] S2-VER: Semi-supervised Visual Emotion Recognition
    Jia, Guoli
    Yang, Jufeng
    COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 493 - 509
  • [46] Semi-Supervised Method for Multi-Category Emotion Recognition in Tweets
    Sintsova, Valentina
    Musat, Claudiu
    Pu, Pearl
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 393 - 402
  • [47] Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
    Feng, Tiantian
    Narayanan, Shrikanth
    INTERSPEECH 2022, 2022, : 5050 - 5054
  • [48] Learning ladder neural networks for semi-supervised node classification in social network
    Li, Bentian
    Pi, Dechang
    Lin, Yunxia
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
  • [49] INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION
    Khonglah, Banriskhem
    Madikeri, Srikanth
    Dey, Subhadeep
    Bourlard, Herve
    Motlicek, Petr
    Billa, Jayadev
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7419 - 7423
  • [50] Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
    Zhu H.
    Gao D.
    Cheng G.
    Povey D.
    Zhang P.
    Yan Y.
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2023, 31 : 3320 - 3330