Multi-label, multi-task CNN approach for context-based emotion recognition

被引:36
作者
Bendjoudi, Ilyes [1 ]
Vanderhaegen, Frederic [1 ]
Hamad, Denis [2 ]
Dornaika, Fadi [3 ]
机构
[1] Univ Polytech Hauts France, LAMIH UMR 8201, F-59313 Le Mont Houy 9, Valenciennes, France
[2] Univ Littoral Cote dOpale, LISIC, BP 719,50 Rue Ferdinand Buisson, F-62228 Calais, France
[3] Univ Basque Country, Manuel Lardizabal 1, San Sebastian 20018, Spain
关键词
Emotion recognition; Loss function; Multi-task machine learning; Deep learning; Unbalanced data; FACIAL EXPRESSION RECOGNITION;
D O I
10.1016/j.inffus.2020.11.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new deep learning architecture for context-based multi-label multi-task emotion recognition. The architecture is built from three main modules: (1) a body features extraction module, which is a pre-trained Xception network, (2) a scene features extraction module, based on a modified VGG16 network, and (3) a fusion-decision module. Moreover, three categorical and three continuous loss functions are compared in order to point out the importance of the synergy between loss functions when it comes to multi-task learning. Then, we propose a new loss function, the multi-label focal loss (MFL), based on the focal loss to deal with imbalanced data. Experimental results on EMOTIC dataset show that MFL with the Huber loss gave better results than any other combination and outperformed the current state of art on the less frequent labels.
引用
收藏
页码:422 / 428
页数:7
相关论文
共 50 条
[21]   A Multi-Task Neural Approach for Emotion Attribution, Classification, and Summarization [J].
Tu, Guoyun ;
Fu, Yanwei ;
Li, Boyang ;
Gao, Jiarui ;
Jiang, Yu-Gang ;
Xue, Xiangyang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2020, 22 (01) :148-159
[22]   Facial Action Unit Detection with Multilayer Fused Multi-Task and Multi-Label Deep Learning Network [J].
He, Jun ;
Li, Dongliang ;
Bo, Sun ;
Yu, Lejun .
KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2019, 13 (11) :5546-5559
[23]   Discretized Continuous Speech Emotion Recognition with Multi-Task Deep Recurrent Neural Network [J].
Duc Le ;
Aldeneh, Zakaria ;
Provost, Emily Mower .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :1108-1112
[24]   Multi-label emotion classification of Urdu tweets [J].
Ashraf, Noman ;
Khan, Lal ;
Butt, Sabur ;
Chang, Hsien-Tsung ;
Sidorov, Grigori ;
Gelbukh, Alexander .
PEERJ COMPUTER SCIENCE, 2022, 8
[25]   Multi-Label Facial Emotion Recognition Using Korean Drama Video Clips [J].
Cho, Heeryon ;
Kang, Woo Kyu ;
Park, Youn-Soo ;
Chae, Sun Geu ;
Kim, Seong-joon .
2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, :215-221
[26]   A multi-task CNN approach for lung nodule malignancy classification and characterization [J].
Marques, Sonia ;
Schiavo, Filippo ;
Ferreira, Carlos A. ;
Pedrosa, Joao ;
Cunha, Antonio ;
Campilho, Aurelio .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 184 (184)
[27]   Choosing the right loss function for multi-label Emotion Classification [J].
Hurtado, Lluis-E ;
Gonzalez, Jose-Angel ;
Pla, Ferran .
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 36 (05) :4697-4708
[28]   Fusing Emoji Emotion Distribution for Multi-Label Emotion Classification [J].
Chen, Sufen ;
Liu, Ye ;
Zeng, Xueqiang .
2024 6TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING, ICNLP 2024, 2024, :129-133
[29]   CROSS-CORPUS ACOUSTIC EMOTION RECOGNITION FROM SINGING AND SPEAKING: A MULTI-TASK LEARNING APPROACH [J].
Zhang, Biqiao ;
Provost, Emily Mower ;
Essl, Georg .
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :5805-5809
[30]   MLDT: Multi-task Learning with Denoising Transformer for Gait Identity and Emotion Recognition [J].
Sheng, Weijie ;
Lu, Xiaoyan ;
Li, Xinde .
AICCC 2021: 2021 4TH ARTIFICIAL INTELLIGENCE AND CLOUD COMPUTING CONFERENCE, 2021, :47-52