Disentangled Variational Autoencoder for Emotion Recognition in Conversations

被引:5
|
作者
Yang, Kailai [1 ]
Zhang, Tianlin [1 ]
Ananiadou, Sophia [1 ]
机构
[1] Univ Manchester, Dept Comp Sci, NaCTeM, Manchester M13 9PL, England
基金
英国生物技术与生命科学研究理事会;
关键词
Task analysis; Emotion recognition; Hidden Markov models; Context modeling; Decoding; Oral communication; Gaussian distribution; Emotion recognition in conversations; variational autoencoder; valence-arousal-dominance; disentangled representations; DIALOGUE;
D O I
10.1109/TAFFC.2023.3280038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train the model to generate the response of the target utterance, which aims to recognise emotions leveraging contextual information. However, adjacent response generation ignores long-range dependencies and provides limited affective information in many cases. In addition, most ERC models learn a unified distributed representation for each utterance, which lacks interpretability and robustness. To address these issues, we propose a VAD-disentangled Variational AutoEncoder (VAD-VAE), which first introduces a target utterance reconstruction task based on Variational Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from the latent space. We also enhance the disentangled representations by introducing VAD supervision signals from a sentiment lexicon and minimising the mutual information between VAD distributions. Experiments show that VAD-VAE outperforms the state-of-the-art model on two datasets. Further analysis proves the effectiveness of each proposed module and the quality of disentangled VAD representations.
引用
收藏
页码:508 / 518
页数:11
相关论文
共 50 条
  • [21] Bimodal variational autoencoder for audiovisual speech recognition
    Hadeer M. Sayed
    Hesham E. ElDeeb
    Shereen A. Taie
    Machine Learning, 2023, 112 : 1201 - 1226
  • [22] Self-supervised learning for tool wear monitoring with a disentangled-variational-autoencoder
    von Hahn, Tim
    Mechefske, Chris K.
    INTERNATIONAL JOURNAL OF HYDROMECHATRONICS, 2021, 4 (01) : 69 - 98
  • [23] Hypergraph Neural Network for Emotion Recognition in Conversations
    Zheng, Cheng
    Xu, Haojie
    Sun, Xiao
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2024, 23 (02)
  • [24] Fusing pairwise modalities for emotion recognition in conversations
    Fan, Chunxiao
    Lin, Jie
    Mao, Rui
    Cambria, Erik
    INFORMATION FUSION, 2024, 106
  • [25] Knowing What and Why: Causal emotion entailment for emotion recognition in conversations
    Liu, Hao
    Wei, Runguo
    Tu, Geng
    Lin, Jiali
    Jiang, Dazhi
    Cambria, Erik
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 274
  • [26] A Physically Constrained Variational Autoencoder for Geochemical Pattern Recognition
    Yihui Xiong
    Renguang Zuo
    Zijing Luo
    Xueqiu Wang
    Mathematical Geosciences, 2022, 54 : 783 - 806
  • [27] A Physically Constrained Variational Autoencoder for Geochemical Pattern Recognition
    Xiong, Yihui
    Zuo, Renguang
    Luo, Zijing
    Wang, Xueqiu
    MATHEMATICAL GEOSCIENCES, 2022, 54 (04) : 783 - 806
  • [28] Condition-Transforming Variational Autoencoder for Generating Diverse Short Text Conversations
    Ruan, Yu-Ping
    Ling, Zhen-Hua
    Zhu, Xiaodan
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2020, 19 (06)
  • [29] Emotion-Guided Music Accompaniment Generation Based on Variational Autoencoder
    Wang, Qi
    Zhang, Shubing
    Zhou, Li
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [30] DialoguePCN: Perception and Cognition Network for Emotion Recognition in Conversations
    Wu, Xiaolong
    Feng, Chang
    Xu, Mingxing
    Zheng, Thomas Fang
    Hamdulla, Askar
    IEEE ACCESS, 2023, 11 : 141251 - 141260