Disentangled Variational Autoencoder for Emotion Recognition in Conversations

被引：5

作者：

Yang, Kailai ^{[1
]}

Zhang, Tianlin ^{[1
]}

Ananiadou, Sophia ^{[1
]}

机构：

[1] Univ Manchester, Dept Comp Sci, NaCTeM, Manchester M13 9PL, England

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2024年 / 15卷 / 02期

基金：

英国生物技术与生命科学研究理事会;

关键词：

Task analysis; Emotion recognition; Hidden Markov models; Context modeling; Decoding; Oral communication; Gaussian distribution; Emotion recognition in conversations; variational autoencoder; valence-arousal-dominance; disentangled representations; DIALOGUE;

D O I：

10.1109/TAFFC.2023.3280038

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train the model to generate the response of the target utterance, which aims to recognise emotions leveraging contextual information. However, adjacent response generation ignores long-range dependencies and provides limited affective information in many cases. In addition, most ERC models learn a unified distributed representation for each utterance, which lacks interpretability and robustness. To address these issues, we propose a VAD-disentangled Variational AutoEncoder (VAD-VAE), which first introduces a target utterance reconstruction task based on Variational Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from the latent space. We also enhance the disentangled representations by introducing VAD supervision signals from a sentiment lexicon and minimising the mutual information between VAD distributions. Experiments show that VAD-VAE outperforms the state-of-the-art model on two datasets. Further analysis proves the effectiveness of each proposed module and the quality of disentangled VAD representations.

引用

页码：508 / 518

页数：11

共 50 条

[21] Bimodal variational autoencoder for audiovisual speech recognition
Hadeer M. Sayed
Hesham E. ElDeeb
Shereen A. Taie
Machine Learning, 2023, 112 : 1201 - 1226
[22] Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder
von Hahn, Tim
Mechefske, Chris K.
INTERNATIONAL JOURNAL OF HYDROMECHATRONICS, 2021, 4 (01) : 69 - 98
[23] Hypergraph Neural Network for Emotion Recognition in Conversations
Zheng, Cheng
Xu, Haojie
Sun, Xiao
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
[24] Fusing pairwise modalities for emotion recognition in conversations
Fan, Chunxiao
Lin, Jie
Mao, Rui
Cambria, Erik
INFORMATION FUSION, 2024, 106
[25] Knowing What and Why: Causal emotion entailment for emotion recognition in conversations
Liu, Hao
Wei, Runguo
Tu, Geng
Lin, Jiali
Jiang, Dazhi
Cambria, Erik
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
[26] A Physically Constrained Variational Autoencoder for Geochemical Pattern Recognition
Yihui Xiong
Renguang Zuo
Zijing Luo
Xueqiu Wang
Mathematical Geosciences, 2022, 54 : 783 - 806
[27] A Physically Constrained Variational Autoencoder for Geochemical Pattern Recognition
Xiong, Yihui
Zuo, Renguang
Luo, Zijing
Wang, Xueqiu
MATHEMATICAL GEOSCIENCES, 2022, 54 (04) : 783 - 806
[28] Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations
Ruan, Yu-Ping
Ling, Zhen-Hua
Zhu, Xiaodan
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (06)
[29] Emotion-Guided Music Accompaniment Generation Based on Variational Autoencoder
Wang, Qi
Zhang, Shubing
Zhou, Li
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[30] DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations
Wu, Xiaolong
Feng, Chang
Xu, Mingxing
Zheng, Thomas Fang
Hamdulla, Askar
IEEE ACCESS, 2023, 11 : 141251 - 141260

← 1 2 3 4 5 →