EmoComicNet: A multi-task model for comic emotion recognition

被引:4
|
作者
Dutta, Arpita [1 ,2 ]
Biswas, Samit [1 ]
Das, Amit Kumar [1 ]
机构
[1] Indian Inst Engn Science&Technol, Dept Comp Science&Technol, Howrah 711103, West Bengal, India
[2] Techno Main, Artificial Intelligence & Machine Learning, Dept Comp Sci & Engn, Kolkata 700091, West Bengal, India
关键词
Comic analysis; Multi-modal emotion recognition; Document image processing; Deep learning; Multi-task learning;
D O I
10.1016/j.patcog.2024.110261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emotion and sentiment associated with comic scenes can provide potential information for inferring the context of comic stories, which is an essential pre -requisite for developing comics' automatic content understanding tools. Here, we address this open area of comic research by exploiting the multi -modal nature of comics. The general assumptions for multi -modal sentiment analysis methods are that both image and text modalities are always present at the test phase. However, this assumption is not always satisfied for comics since comic characters' facial expressions, gestures, etc., are not always clearly visible. Also, the dialogues between comic characters are often challenging to comprehend the underlying context. To deal with these constraints of comic emotion analysis, we propose a multi -task -based framework, namely EmoComicNet, to fuse multi -modal information (i.e., both image and text) if it is available. However, the proposed EmoComicNet is designed to perform even when any modality is weak or completely missing. The proposed method potentially improves the overall performance. Besides, EmoComicNet can also deal with the problem of weak or absent modality during the training phase.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Survey on multi-task learning for object classification and recognition
    Li H.
    Wang F.
    Ding W.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (01):
  • [32] SELECTIVE MULTI-TASK LEARNING FOR SPEECH EMOTION RECOGNITION USING CORPORA OF DIFFERENT STYLES
    Zhang, Heran
    Mimura, Masato
    Kawahara, Tatsuya
    Ishizuka, Kenkichi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7707 - 7711
  • [33] LEVERAGING VALENCE AND ACTIVATION INFORMATION VIA MULTI-TASK LEARNING FOR CATEGORICAL EMOTION RECOGNITION
    Xia, Rui
    Liu, Yang
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5301 - 5305
  • [34] Driver multi-task emotion recognition network based on multi-modal facial video analysis
    Xiang, Guoliang
    Yao, Song
    Wu, Xianhui
    Deng, Hanwen
    Wang, Guojie
    Liu, Yu
    Li, Fan
    Peng, Yong
    PATTERN RECOGNITION, 2025, 161
  • [35] Identity, Gender, Age, and Emotion Recognition from Speaker Voice with Multi-task Deep Networks for Cognitive Robotics
    Foggia, Pasquale
    Greco, Antonio
    Roberto, Antonio
    Saggese, Alessia
    Vento, Mario
    COGNITIVE COMPUTATION, 2024, 16 (05) : 2713 - 2723
  • [36] Dual Multi-Task Network with Bridge-Temporal-Attention for Student Emotion Recognition via Classroom Video
    He, Jun
    Peng, Li
    Sun, Bo
    Yu, Lejun
    Guo, Meng
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [37] Adversarial Multi-task Model for Emotion, Sentiment, and Sarcasm Aided Complaint Detection
    Singh, Apoorva
    Nazir, Arousha
    Saha, Sriparna
    ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 428 - 442
  • [38] MTLFuseNet: A novel emotion recognition model based on deep latent feature fusion of EEG signals and multi-task learning
    Li, Rui
    Ren, Chao
    Ge, Yiqing
    Zhao, Qiqi
    Yang, Yikun
    Shi, Yuhan
    Zhang, Xiaowei
    Hu, Bin
    KNOWLEDGE-BASED SYSTEMS, 2023, 276
  • [39] Chinese Named Entity Recognition Model Based on Multi-Task Learning
    Fang, Qin
    Li, Yane
    Feng, Hailin
    Ruan, Yaoping
    APPLIED SCIENCES-BASEL, 2023, 13 (08):
  • [40] Cross-Corpus Speech Emotion Recognition Based on Multi-Task Learning and Subdomain Adaptation
    Fu, Hongliang
    Zhuang, Zhihao
    Wang, Yang
    Huang, Chen
    Duan, Wenzhuo
    ENTROPY, 2023, 25 (01)