A Contextual Attention Network for Multimodal Emotion Recognition in Conversation

被引:4
作者
Wang, Tana [1 ]
Hou, Yaqing [1 ]
Zhou, Dongsheng [2 ]
Zhang, Qiang [1 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Univ, Key Lab Adv Design & Intelligent Comp, Minist Educ, Dalian, Peoples R China
来源
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2021年
基金
中国国家自然科学基金;
关键词
neural computing; emotion recognition in dyadic conversation; multimodal fusion network; attention mechanism;
D O I
10.1109/IJCNN52387.2021.9533718
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition in conversation (ERC) is a challenging task due to the complexity of emotions and dynamics in dialogues. Current studies for emotion recognition mostly focus on the modeling of a single utterance in dialogue, which neglects self and inter-speaker influence. This paper presents a contextual attention neural network based on the multimodal framework that leverages the conversational information from both target and the other speaker for utterance-level emotion detection. Specifically, we utilize recurrent neural networks based on contextual attention for modeling the transaction and dependence between speakers. Further, the feature fusion is proposed to unite the important modal information extracted from multiple modalities, including audio, text and video, hence providing more useful and comprehensive knowledge for emotion recognition. The proposed approach shows its superiority in extracting contexts for self and inter-speaker influence and synthesizing them as global features that are beneficial to detect individual emotion state. Experiment result on the IEMOCAP corpus reports an accuracy of 64.6%, demonstrating the superiority of the proposed method in emotion recognition comparing to the state-of-the-arts.
引用
收藏
页数:7
相关论文
共 33 条
[1]  
[Anonymous], 2018, PROCEEDINGS OF THE 2
[2]  
[Anonymous], 2016, DEEP MULTIMODAL FUSI
[3]  
[Anonymous], 2018, PROCEEDINGS OF THE A
[4]  
[Anonymous], 2018, PROCEEDINGS OF THE A
[5]  
[Anonymous], 2018, PROCEEDINGS OF THE A
[6]  
[Anonymous], 2017, CONTINUOUS MULTIMODA
[7]  
[Anonymous], 2017, PROCEEDINGS OF THE 5
[8]  
[Anonymous], 2017, IEEE T PATTERN ANAL
[9]  
[Anonymous], 2018, HUMAN CONVERSATION A
[10]  
[Anonymous], 2018, MULTIMODAL SENTIMENT