A Versatile Multimodal Learning Framework for Zero-Shot Emotion Recognition

被引:1
作者
Qi, Fan [1 ]
Zhang, Huaiwen [2 ,3 ]
Yang, Xiaoshan [4 ,5 ,6 ]
Xu, Changsheng [4 ,5 ,6 ]
机构
[1] Tianjin Univ Technol, Sch Comp Sci & Engn, Tianjin 300384, Peoples R China
[2] Inner Mongolia Univ, Coll Comp Sci, Hohhot 010021, Peoples R China
[3] Natl & Local Joint Engn Res Ctr Intelligent Infor, Hohhot 010021, Peoples R China
[4] Chinese Acad Sci, Inst Automat, State Key Lab Multimodal Artificial Intelligence, Beijing 100190, Peoples R China
[5] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100190, Peoples R China
[6] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal emotion recognition; zero-shot learning; transformer; NETWORKS; MODEL;
D O I
10.1109/TCSVT.2024.3362270
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Multi-modal Emotion Recognition (MER) aims to identify various human emotions from heterogeneous modalities. With the development of emotional theories, there are more and more novel and fine-grained concepts to describe human emotional feelings. Real-world recognition systems often encounter unseen emotion labels. To address this challenge, we propose a versatile zero-shot MER framework to refine emotion label embeddings for capturing inter-label relationships and improving discrimination between labels. We integrate prior knowledge into a novel affective graph space that generates tailored label embeddings capturing inter-label relationships. To obtain multimodal representations, we disentangle the features of each modality into egocentric and altruistic components using adversarial learning. These components are then hierarchically fused using a hybrid co-attention mechanism. Furthermore, an emotion-guided decoder exploits label-modal dependencies to generate adaptive multimodal representations guided by emotion embeddings. We conduct extensive experiments with different multimodal combinations, including visual-acoustic and visual-textual inputs, on four datasets in both single-label and multi-label zero-shot settings. Results demonstrate the superiority of our proposed framework over state-of-the-art methods.
引用
收藏
页码:5728 / 5741
页数:14
相关论文
共 50 条
  • [41] Chinese medical named entity recognition based on zero-shot learning
    Zhou, Menglin
    Gong, Kecun
    2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 190 - 195
  • [42] Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes
    Xu, Xinzhou
    Deng, Jun
    Cummins, Nicholas
    Zhang, Zixing
    Zhao, Li
    Schuller, Bjoern W.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 2752 - 2765
  • [43] Self-Assembled Generative Framework for Generalized Zero-Shot Learning
    Gao, Mengyu
    Dong, Qiulei
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 914 - 924
  • [44] Zero-Shot Transfer Learning Framework for Plant Leaf Disease Classification
    Satya Rajendra Singh, R.
    Sanodiya, Rakesh Kumar
    IEEE ACCESS, 2023, 11 : 143861 - 143880
  • [45] Zero-shot Learning via the fusion of generation and embedding for image recognition
    Zhao, Peng
    Zhang, Siying
    Liu, Jinhui
    Liu, Huiting
    INFORMATION SCIENCES, 2021, 578 (578) : 831 - 847
  • [46] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [47] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [48] Semantic-aware visual attributes learning for zero-shot recognition
    Xie, Yurui
    Song, Tiecheng
    Li, Wei
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2021, 74
  • [49] A zero-shot learning framework via cluster-prototype matching
    Zhang, Jing
    Li, Qingyong
    Geng, YangLi-ao
    Wang, Wen
    Sun, Wenju
    Shi, Chuan
    Ding, Zhengming
    PATTERN RECOGNITION, 2022, 124
  • [50] A Generalized Zero-Shot Learning Framework for PolSAR Land Cover Classification
    Gui, Rong
    Xu, Xin
    Wang, Lei
    Yang, Rui
    Pu, Fangling
    REMOTE SENSING, 2018, 10 (08)