Multi-task Model for Comic Book Image Analysis

被引:4
|
作者
Nhu-Van Nguyen [1 ]
Rigaud, Christophe [1 ]
Burie, Jean-Christophe [1 ]
机构
[1] Univ La Rochelle, Lab L3i, F-17042 La Rochelle 1, France
来源
MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷
关键词
Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;
D O I
10.1007/978-3-030-05716-9_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.
引用
收藏
页码:637 / 649
页数:13
相关论文
共 50 条
  • [1] Comic MTL: optimized multi-task learning for comic book image analysis
    Nhu-Van Nguyen
    Rigaud, Christophe
    Burie, Jean-Christophe
    INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (03) : 265 - 284
  • [2] Comic MTL: optimized multi-task learning for comic book image analysis
    Nhu-Van Nguyen
    Christophe Rigaud
    Jean-Christophe Burie
    International Journal on Document Analysis and Recognition (IJDAR), 2019, 22 : 265 - 284
  • [3] EmoComicNet: A multi-task model for comic emotion recognition
    Dutta, Arpita
    Biswas, Samit
    Das, Amit Kumar
    PATTERN RECOGNITION, 2024, 150
  • [4] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
    Zhang, Wenlu
    Li, Rongjian
    Zeng, Tao
    Sun, Qian
    Kumar, Sudhir
    Ye, Jieping
    Ji, Shuiwang
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 1475 - 1484
  • [5] Deep Model Based Transfer and Multi-Task Learning for Biological Image Analysis
    Zhang, Wenlu
    Li, Rongjian
    Zeng, Tao
    Sun, Qian
    Kumar, Sudhir
    Ye, Jieping
    Ji, Shuiwang
    IEEE TRANSACTIONS ON BIG DATA, 2020, 6 (02) : 322 - 333
  • [6] Breast cancer pathological image classification based on multi-task model
    Yu L.
    Xia Y.
    Wang P.
    Yan Y.
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2021, 49 (08): : 53 - 57
  • [7] Multi-task deep learning for medical image computing and analysis: A review
    Zhao, Yan
    Wang, Xiuying
    Che, Tongtong
    Bao, Guoqing
    Li, Shuyu
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 153
  • [8] Multi-task Deep Learning for Image Understanding
    Yu, Bo
    Lane, Ian
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
  • [9] CubiCasa5K: A Dataset and an Improved Multi-task Model for Floorplan Image Analysis
    Kalervo, Ahti
    Ylioinas, Juha
    Haikio, Markus
    Karhu, Antti
    Kannala, Juho
    IMAGE ANALYSIS, 2019, 11482 : 28 - 40
  • [10] Multi-task Survival Analysis
    Wang, Lu
    Li, Yan
    Zhou, Jiayu
    Zhu, Dongxiao
    Ye, Jieping
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 485 - 494