Cluster-Level Contrastive Learning for Emotion Recognition in Conversations

被引:28
作者
Yang, Kailai [1 ,2 ]
Zhang, Tianlin [1 ,2 ]
Alhuzali, Hassan [3 ]
Ananiadou, Sophia [1 ,2 ]
机构
[1] Univ Manchester, NaCTeM, Manchester M13 9PL, England
[2] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, England
[3] Umm Al Qura Univ, Coll Comp & Informat Syst, Mecca 24382, Saudi Arabia
基金
英国生物技术与生命科学研究理事会;
关键词
Emotion recognition; Prototypes; Linguistics; Task analysis; Semantics; Training; Adaptation models; Cluster-level contrastive learning; emotion recognition in conversations; pre-trained knowledge adapters; valence-arousal-dominance; DIALOGUE; FRAMEWORK;
D O I
10.1109/TAFFC.2023.3243463
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A key challenge for Emotion Recognition in Conversations (ERC) is to distinguish semantically similar emotions. Some works utilise Supervised Contrastive Learning (SCL) which uses categorical emotion labels as supervision signals and contrasts in high-dimensional semantic space. However, categorical labels fail to provide quantitative information between emotions. ERC is also not equally dependent on all embedded features in the semantic space, which makes the high-dimensional SCL inefficient. To address these issues, we propose a novel low-dimensional Supervised Cluster-level Contrastive Learning (SCCL) method, which first reduces the high-dimensional SCL space to a three-dimensional affect representation space Valence-Arousal-Dominance (VAD), then performs cluster-level contrastive learning to incorporate measurable emotion prototypes. To help modelling the dialogue and enriching the context, we leverage the pre-trained knowledge adapters to infuse linguistic and factual knowledge. Experiments show that our method achieves new state-of-the-art results with 69.81% on IEMOCAP, 65.7% on MELD, and 62.51% on DailyDialog datasets. The analysis also proves that the VAD space is not only suitable for ERC but also interpretable, with VAD prototypes enhancing its performance and stabilising the training of SCCL. In addition, the pre-trained knowledge adapters benefit the performance of the utterance encoder and SCCL. Our code is available at: https://github.com/SteveKGYang/SCCLI
引用
收藏
页码:3269 / 3280
页数:12
相关论文
共 50 条
[41]   Masked Graph Learning With Recurrent Alignment for Multimodal Emotion Recognition in Conversation [J].
Meng, Tao ;
Zhang, Fuchen ;
Shou, Yuntao ;
Shao, Hongen ;
Ai, Wei ;
Li, Keqin .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 :4298-4312
[42]   Federated Multidomain Learning With Graph Ensemble Autoencoder GMM for Emotion Recognition [J].
Zhang, Chunjiong ;
Li, Mingyong ;
Wu, Di .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (07) :7631-7641
[43]   Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition [J].
Zhou, Ying ;
Liang, Xuefeng ;
Gu, Yu ;
Yin, Yifei ;
Yao, Longshan .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 :695-705
[44]   Unsupervised Learning in Reservoir Computing for EEG-Based Emotion Recognition [J].
Fourati, Rahma ;
Ammar, Boudour ;
Sanchez-Medina, Javier ;
Alimi, Adel M. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (02) :972-984
[45]   A Transformer-Based Model With Self-Distillation for Multimodal Emotion Recognition in Conversations [J].
Ma, Hui ;
Wang, Jian ;
Lin, Hongfei ;
Zhang, Bo ;
Zhang, Yijia ;
Xu, Bo .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :776-788
[46]   Negative Selection by Clustering for Contrastive Learning in Human Activity Recognition [J].
Wang, Jinqiang ;
Zhu, Tao ;
Chen, Liming Luke ;
Ning, Huansheng ;
Wan, Yaping .
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (12) :10833-10844
[47]   Improving Representation With Hierarchical Contrastive Learning for Emotion-Cause Pair Extraction [J].
Hu, Guimin ;
Zhao, Yi ;
Lu, Guangming .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2024, 15 (04) :1997-2011
[48]   Advancing Imbalanced Domain Adaptation: Cluster-Level Discrepancy Minimization With a Comprehensive Benchmark [J].
Yang, Jianfei ;
Yang, Jiangang ;
Wang, Shizheng ;
Cao, Shuxin ;
Zou, Han ;
Xie, Lihua .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (02) :1106-1117
[49]   Dual-Level Contrastive Learning for Improving Conciseness of Summarization [J].
Peng, Wei ;
Zhang, Han ;
Jiang, Dan ;
Xiao, Kejing ;
Li, Yuxuan .
IEEE ACCESS, 2024, 12 :65630-65639
[50]   Multitask Learning From Augmented Auxiliary Data for Improving Speech Emotion Recognition [J].
Latif, Siddique ;
Rana, Rajib ;
Khalifa, Sara ;
Jurdak, Raja ;
Schuller, Bjorn W. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) :3164-3176