Enhancing Emotion Recognition in Conversation with Dialogue Discourse Structure and Commonsense Knowledge

被引:0
作者
Hao, Jiawang [1 ]
Kong, Fang [1 ]
Kang, Junjun [1 ]
机构
[1] Soochow Univ, Lab Nat Language Proc, Suzhou, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT IV, ICIC 2024 | 2024年 / 14878卷
基金
中国国家自然科学基金;
关键词
Emotion Recognition; Graph Neural Network; Discourse Structure; Knowledge Graph; Commonsense Knowledge;
D O I
10.1007/978-981-97-5672-8_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion Recognition in Conversations (ERC) is the task of identifying the emotions of utterances from speakers in a conversation, which is beneficial to many applications. In this paper, we introduce two kinds of external knowledge, i.e., dialogue discourse structure and social commonsense knowledge implied in dialogue to enhance representation and emotional reasoning. The dialogue discourse structure directly reveals the adjacent or long-distance dependencies between utterances and provides prior knowledge for the semantic interaction between utterances. Implicit commonsense knowledge in utterances can serve as emotional inference cues to model deeper inter-utterance emotional interactions. Specifically, we construct a discourse structure and commonsense knowledge enhanced graph structure over the conversation and use graph convolutional networks to incorporate historical context and commonsense knowledge for utterances. Experimental results show that incorporating discourse structure and commonsense knowledge can effectively improve the performance of the model.
引用
收藏
页码:257 / 268
页数:12
相关论文
共 30 条
  • [1] Asher N, 2016, LREC 2016 - TENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P2721
  • [2] Bosselut A, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P4762
  • [3] IEMOCAP: interactive emotional dyadic motion capture database
    Busso, Carlos
    Bulut, Murtaza
    Lee, Chi-Chun
    Kazemzadeh, Abe
    Mower, Emily
    Kim, Samuel
    Chang, Jeannette N.
    Lee, Sungbok
    Narayanan, Shrikanth S.
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2008, 42 (04) : 335 - 359
  • [4] Affect-GCN: a multimodal graph convolutional network for multi-emotion with intensity recognition and sentiment analysis in dialogues
    Firdaus, Mauajama
    Singh, Gopendra Vikram
    Ekbal, Asif
    Bhattacharyya, Pushpak
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) : 43251 - 43272
  • [5] Ghosal D, 2020, Arxiv, DOI arXiv:2010.02795
  • [6] Ghosal D, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P154
  • [7] Ishiwatari T, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P7360
  • [8] Jiao WX, 2019, Arxiv, DOI arXiv:1904.04446
  • [9] DialogueTRGAT: Temporal and Relational Graph Attention Network for Emotion Recognition in Conversations
    Kang, Junjun
    Kong, Fang
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 460 - 472
  • [10] Kumar A, 2015, INT CONF CONTEMP, P285, DOI 10.1109/IC3.2015.7346694