Multi-task Model for Comic Book Image Analysis

被引:4
|
作者
Nhu-Van Nguyen [1 ]
Rigaud, Christophe [1 ]
Burie, Jean-Christophe [1 ]
机构
[1] Univ La Rochelle, Lab L3i, F-17042 La Rochelle 1, France
来源
MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷
关键词
Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;
D O I
10.1007/978-3-030-05716-9_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.
引用
收藏
页码:637 / 649
页数:13
相关论文
共 50 条
  • [21] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [22] Automatic Analysis of Transverse Musculoskeletal Ultrasound Images Based on the Multi-Task Learning Model
    Zhou, Linxueying
    Liu, Shangkun
    Zheng, Weimin
    ENTROPY, 2023, 25 (04)
  • [23] Image Recognition of Chinese herbal pieces Based on Multi-task Learning Model
    Hu, Ji-Li
    Wang, Yong-Kang
    Che, Zeng-Yang
    Li, Qian-Qian
    Jiang, Hong-Kun
    Liu, Ling-Jie
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1555 - 1559
  • [24] Multi-modal Sentiment and Emotion Joint Analysis with a Deep Attentive Multi-task Learning Model
    Zhang, Yazhou
    Rong, Lu
    Li, Xiang
    Chen, Rui
    ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 518 - 532
  • [25] Semantic Communication Approach for Multi-Task Image Transmission
    Zhang, Zhenguo
    Yang, Qianqian
    He, Shibo
    Shi, Zhiguo
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [26] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [27] Enhanced representation and multi-task learning for image annotation
    Binder, Alexander
    Samek, Wojciech
    Mueller, Klaus-Robert
    Kawanabe, Motoaki
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) : 466 - 478
  • [28] Multi-task based Image Aesthetics Quality Evaluation
    Jiang, Min
    Chen, Zhe
    Jiang, Jiajun
    Liu, Xiaoming
    Hu, Wei
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2755 - 2760
  • [29] Learning multi-task local metrics for image annotation
    Xu, Xing
    Shimada, Atsushi
    Nagahara, Hajime
    Taniguchi, Rin-ichiro
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (04) : 2203 - 2231
  • [30] A Multi-task Method for Immunofixation Electrophoresis Image Classification
    Shi, Yi
    Li, Rui-Xiang
    Shao, Wen-Qi
    Duan, Xin-Cen
    Ye, Han-Jia
    Zhan, De-Chuan
    Pan, Bai-Shen
    Wang, Bei-Li
    Guo, Wei
    Jiang, Yuan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 148 - 158