Semi-supervised Ladder Networks for Speech Emotion Recognition

被引:0
|
作者
Jian-Hua Tao
Jian Huang
Ya Li
Zheng Lian
Ming-Yue Niu
机构
[1] National Laboratory of Pattern Recognition,School of Artificial Intelligence
[2] University of Chinese Academy of Science (CAS),CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation
[3] Chinese Academy of Sciences,undefined
来源
International Journal of Automation and Computing | 2019年 / 16卷
关键词
Speech emotion recognition; the ladder network; semi-supervised learning; autoencoder; regularization;
D O I
暂无
中图分类号
学科分类号
摘要
As a major component of speech signal processing, speech emotion recognition has become increasingly essential to understanding human communication. Benefitting from deep learning, many researchers have proposed various unsupervised models to extract effective emotional features and supervised models to train emotion recognition systems. In this paper, we utilize semi-supervised ladder networks for speech emotion recognition. The model is trained by minimizing the supervised loss and auxiliary unsupervised cost function. The addition of the unsupervised auxiliary task provides powerful discriminative representations of the input features, and is also regarded as the regularization of the emotional supervised task. We also compare the ladder network with other classical autoencoder structures. The experiments were conducted on the interactive emotional dyadic motion capture (IEMOCAP) database, and the results reveal that the proposed methods achieve superior performance with a small number of labelled data and achieves better performance than other methods.
引用
收藏
页码:437 / 448
页数:11
相关论文
共 50 条
  • [1] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Tao, Jian-Hua
    Huang, Jian
    Li, Ya
    Lian, Zheng
    Niu, Ming-Yue
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2019, 16 (04) : 437 - 448
  • [2] Semi-Supervised Speech Emotion Recognition With Ladder Networks
    Parthasarathy, Srinivas
    Busso, Carlos
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2697 - 2709
  • [3] Semi-Supervised Speech Emotion Recognition with Ladder Networks
    Parthasarathy, Srinivas
    Busso, Carlos
    IEEE/ACM Transactions on Audio Speech and Language Processing, 2020, 28 : 2697 - 2709
  • [4] Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    International Journal of Automation and Computing, 2019, (04) : 437 - 448
  • [5] Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    International Journal of Automation and Computing, 2021, 18 : 680 - 680
  • [6] Speech Emotion Recognition Using Semi-supervised Learning with Ladder Networks
    Huang, Jian
    Li, Ya
    Tao, Jianhua
    Lian, Zheng
    Niu, Mingyue
    Yi, Jiangyan
    2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [7] Semi-supervised Ladder Networks for Speech Emotion Recognition (vol 16, pg 437, 2019)
    Tao, Jian-Hua
    Huang, Jian
    Li, Ya
    Lian, Zheng
    Niu, Ming-Yue
    INTERNATIONAL JOURNAL OF AUTOMATION AND COMPUTING, 2021, 18 (04) : 680 - 680
  • [8] Semi-supervised Model for Emotion Recognition in Speech
    Pereira, Ingryd
    Santos, Diego
    Maciel, Alexandre
    Barros, Pablo
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT I, 2018, 11139 : 791 - 800
  • [9] Semi-supervised Phoneme Recognition with Recurrent Ladder Networks
    Tietz, Marian
    Alpay, Tayfun
    Twiefel, Johannes
    Wermter, Stefan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 : 3 - 10
  • [10] Speech emotion recognition using semi-supervised discriminant analysis
    Zhao, L. (zhaoli@seu.edu.cn), 1600, Southeast University (30):