Semi-supervised Ladder Networks for Speech Emotion Recognition

被引：0

作者：

Jian-Hua Tao

Jian Huang

Ya Li

Zheng Lian

Ming-Yue Niu

机构：

[1] National Laboratory of Pattern Recognition,School of Artificial Intelligence

[2] University of Chinese Academy of Science (CAS),CAS Center for Excellence in Brain Science and Intelligence Technology, Institute of Automation

[3] Chinese Academy of Sciences,undefined

来源：

International Journal of Automation and Computing | 2019年 / 16卷

关键词：

Speech emotion recognition; the ladder network; semi-supervised learning; autoencoder; regularization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

As a major component of speech signal processing, speech emotion recognition has become increasingly essential to understanding human communication. Benefitting from deep learning, many researchers have proposed various unsupervised models to extract effective emotional features and supervised models to train emotion recognition systems. In this paper, we utilize semi-supervised ladder networks for speech emotion recognition. The model is trained by minimizing the supervised loss and auxiliary unsupervised cost function. The addition of the unsupervised auxiliary task provides powerful discriminative representations of the input features, and is also regarded as the regularization of the emotional supervised task. We also compare the ladder network with other classical autoencoder structures. The experiments were conducted on the interactive emotional dyadic motion capture (IEMOCAP) database, and the results reveal that the proposed methods achieve superior performance with a small number of labelled data and achieves better performance than other methods.

引用

页码：437 / 448

页数：11

共 50 条

[41] A review on semi-supervised learning for EEG-based emotion recognition
Qiu, Sen
Chen, Yongtao
Yang, Yulin
Wang, Pengfei
Wang, Zhelong
Zhao, Hongyu
Kang, Yuntong
Nie, Ruicheng
INFORMATION FUSION, 2024, 104
[42] Semi-Supervised Learning for Continuous Emotion Recognition Based on Metric Learning
Choi, Dong Yoon
Song, Byung Cheol
IEEE ACCESS, 2020, 8 : 113443 - 113455
[43] Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning
Sun, Yifu
Zhang, Xulong
Wang, Jianzong
Cheng, Ning
Hu, Kaiyu
Xiao, Jing
INTERSPEECH 2023, 2023, : 5456 - 5460
[44] Deep Recurrent Semi-Supervised EEG Representation Learning for Emotion Recognition
Zhang, Guangyi
Teinad, Ali, I
2021 9TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2021,
[45] S2-VER: Semi-supervised Visual Emotion Recognition
Jia, Guoli
Yang, Jufeng
COMPUTER VISION, ECCV 2022, PT XXXVII, 2022, 13697 : 493 - 509
[46] Semi-Supervised Method for Multi-Category Emotion Recognition in Tweets
Sintsova, Valentina
Musat, Claudiu
Pu, Pearl
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 393 - 402
[47] Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
Feng, Tiantian
Narayanan, Shrikanth
INTERSPEECH 2022, 2022, : 5050 - 5054
[48] Learning ladder neural networks for semi-supervised node classification in social network
Li, Bentian
Pi, Dechang
Lin, Yunxia
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[49] INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION
Khonglah, Banriskhem
Madikeri, Srikanth
Dey, Subhadeep
Bourlard, Herve
Motlicek, Petr
Billa, Jayadev
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7419 - 7423
[50] Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Zhu H.
Gao D.
Cheng G.
Povey D.
Zhang P.
Yan Y.
IEEE/ACM Transactions on Audio Speech and Language Processing, 2023, 31 : 3320 - 3330

← 1 2 3 4 5 →