EmoComicNet: A multi-task model for comic emotion recognition

被引:4
|
作者
Dutta, Arpita [1 ,2 ]
Biswas, Samit [1 ]
Das, Amit Kumar [1 ]
机构
[1] Indian Inst Engn Science&Technol, Dept Comp Science&Technol, Howrah 711103, West Bengal, India
[2] Techno Main, Artificial Intelligence & Machine Learning, Dept Comp Sci & Engn, Kolkata 700091, West Bengal, India
关键词
Comic analysis; Multi-modal emotion recognition; Document image processing; Deep learning; Multi-task learning;
D O I
10.1016/j.patcog.2024.110261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emotion and sentiment associated with comic scenes can provide potential information for inferring the context of comic stories, which is an essential pre -requisite for developing comics' automatic content understanding tools. Here, we address this open area of comic research by exploiting the multi -modal nature of comics. The general assumptions for multi -modal sentiment analysis methods are that both image and text modalities are always present at the test phase. However, this assumption is not always satisfied for comics since comic characters' facial expressions, gestures, etc., are not always clearly visible. Also, the dialogues between comic characters are often challenging to comprehend the underlying context. To deal with these constraints of comic emotion analysis, we propose a multi -task -based framework, namely EmoComicNet, to fuse multi -modal information (i.e., both image and text) if it is available. However, the proposed EmoComicNet is designed to perform even when any modality is weak or completely missing. The proposed method potentially improves the overall performance. Besides, EmoComicNet can also deal with the problem of weak or absent modality during the training phase.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Visual-audio emotion recognition based on multi-task and ensemble learning with multiple features
    Hao M.
    Cao W.-H.
    Liu Z.-T.
    Wu M.
    Xiao P.
    Cao, Wei-Hua (weihuacao@cug.edu.cn), 1600, Elsevier B.V., Netherlands (391): : 42 - 51
  • [42] A multi-task model for simultaneous face identification and facial expression recognition
    Zheng, Hao
    Geng, Xin
    Tao, Dacheng
    Jin, Zhong
    NEUROCOMPUTING, 2016, 171 : 515 - 523
  • [43] Multi-Task YOLO for Vehicle Colour Recognition and Automatic License Plate Recognition
    Khor, Yin-Loon
    Wong, Yi Jie
    Tham, Mau-Luen
    Chang, Yoong Choon
    Kwan, Ban-Hoe
    Khor, Kok-Chin
    IEEE CONFERENCE ON EVOLVING AND ADAPTIVE INTELLIGENT SYSTEMS 2024, IEEE EAIS 2024, 2024, : 141 - 147
  • [44] A Multi-Task Learning Framework for Emotion Recognition Using 2D Continuous Space
    Xia, Rui
    Liu, Yang
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (01) : 3 - 14
  • [45] Towards Speech Emotion Recognition "in the wild" using Aggregated Corpora and Deep Multi-Task Learning
    Kim, Jaebok
    Englebienne, Gwenn
    Truong, Khiet P.
    Evers, Vanessa
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1113 - 1117
  • [46] Multi-task coordinate attention gating network for speech emotion recognition under noisy circumstances
    Sun, Linhui
    Lei, Yunlong
    Zhang, Zixiao
    Tang, Yi
    Wang, Jing
    Ye, Lei
    Li, Pingan
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 107
  • [47] Pallet Recognition with Multi-Task Learning for Automated Guided Vehicles
    Mok, Chunghyup
    Baek, Insung
    Cho, Yoon Sang
    Kim, Younghoon
    Kim, Seoung Bum
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [48] A Neural Network Model for Online Multi-Task Multi-Label Pattern Recognition
    Higuchi, Daisuke
    Ozawa, Seiichi
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2013, 2013, 8131 : 162 - 169
  • [49] Multimodal Sentiment Recognition With Multi-Task Learning
    Zhang, Sun
    Yin, Chunyong
    Yin, Zhichao
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (01): : 200 - 209
  • [50] Multi-Task and Multi-Modal Learning for RGB Dynamic Gesture Recognition
    Fan, Dinghao
    Lu, Hengjie
    Xu, Shugong
    Cao, Shan
    IEEE SENSORS JOURNAL, 2021, 21 (23) : 27026 - 27036