Multi-task Model for Comic Book Image Analysis

被引:4
作者
Nhu-Van Nguyen [1 ]
Rigaud, Christophe [1 ]
Burie, Jean-Christophe [1 ]
机构
[1] Univ La Rochelle, Lab L3i, F-17042 La Rochelle 1, France
来源
MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷
关键词
Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;
D O I
10.1007/978-3-030-05716-9_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.
引用
收藏
页码:637 / 649
页数:13
相关论文
共 50 条
  • [41] Spatially Augmented Speech Bubble to Character Association via Comic Multi-task Learning
    Soykan, Gurkan
    Yuret, Deniz
    Sezgin, Tevfik Metin
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024 WORKSHOPS, PT I, 2024, 14935 : 231 - 256
  • [42] Multi-task CNN Model for Action Detection
    Chen, Xin
    Han, Yahong
    2018 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (IEEE VCIP), 2018,
  • [43] Multi-Task Learning for Medical Image Inpainting Based on Organ Boundary Awareness
    Tran, Minh-Trieu
    Kim, Soo-Hyung
    Yang, Hyung-Jeong
    Lee, Guee-Sang
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [44] A Deep Multi-Task Learning Approach for Bioelectrical Signal Analysis
    Medhi, Jishu K.
    Ren, Pusheng
    Hu, Mengsha
    Chen, Xuhui
    MATHEMATICS, 2023, 11 (22)
  • [45] Bacterial image analysis using multi-task deep learning approaches for clinical microscopy
    Chin, Shuang Yee
    Dong, Jian
    Hasikin, Khairunnisa
    Ngui, Romano
    Lai, Khin Wee
    Yeoh, Pauline Shan Qing
    Wu, Xiang
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [46] Adaptive Weight Generator for Multi-Task Image Recognition by Task Grouping Prompt
    Wu, Gaojie
    Zeng, Ling-an
    Meng, Jing-Ke
    Zheng, Wei-Shi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9906 - 9919
  • [47] A multi-task model for failure identification and GPS assessment in metro trains
    Jadhav, Pratik Vinayak
    Sairam, V. A.
    Sonkavade, Siddharth
    Wagle, Shivali Amit
    Pareek, Preksha
    Kotecha, Ketan
    Choudhury, Tanupriya
    AIMS ENVIRONMENTAL SCIENCE, 2024, 11 (06) : 960 - 986
  • [48] A Multi-Task Learning Formulation for Survival Analysis
    Li, Yan
    Wang, Jie
    Ye, Jieping
    Reddy, Chandan K.
    KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1715 - 1724
  • [49] Taxi Demand Prediction Using Parallel Multi-Task Learning Model
    Zhang, Chizhan
    Zhu, Fenghua
    Wang, Xiao
    Sun, Leilei
    Tang, Haina
    Lv, Yisheng
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (02) : 794 - 803
  • [50] Bacterial image analysis using multi-task deep learning approaches for clinical microscopy
    Chin, Shuang Yee
    Dong, Jian
    Hasikin, Khairunnisa
    Ngui, Romano
    Lai, Khin Wee
    Yeoh, Pauline Shan Qing
    Wu, Xiang
    PeerJ Computer Science, 2024, 10