Multi-task Model for Comic Book Image Analysis

被引:4
|
作者
Nhu-Van Nguyen [1 ]
Rigaud, Christophe [1 ]
Burie, Jean-Christophe [1 ]
机构
[1] Univ La Rochelle, Lab L3i, F-17042 La Rochelle 1, France
来源
MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷
关键词
Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;
D O I
10.1007/978-3-030-05716-9_57
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.
引用
收藏
页码:637 / 649
页数:13
相关论文
共 50 条
  • [31] Learning multi-task local metrics for image annotation
    Xing Xu
    Atsushi Shimada
    Hajime Nagahara
    Rin-ichiro Taniguchi
    Multimedia Tools and Applications, 2016, 75 : 2203 - 2231
  • [32] DRMNet: A Multi-Task Detection Model Based on Image Processing for Autonomous Driving Scenarios
    Zhao, Jiandong
    Wu, Di
    Yu, Zhixin
    Gao, Ziyou
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (12) : 15341 - 15355
  • [33] Multi-Task CNN Model for Attribute Prediction
    Abdulnabi, Abrar H.
    Wang, Gang
    Lu, Jiwen
    Jia, Kui
    IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) : 1949 - 1959
  • [34] One model is all you need: Multi-task learning enables simultaneous histology image segmentation and classification
    Graham, Simon
    Vu, Quoc Dang
    Jahanifar, Mostafa
    Raza, Shan E. Ahmed
    Minhas, Fayyaz
    Snead, David
    Rajpoot, Nasir
    MEDICAL IMAGE ANALYSIS, 2023, 83
  • [35] Automatic tongue image quality assessment using a multi-task deep learning model
    Xian, Huimin
    Xie, Yanyan
    Yang, Zizhu
    Zhang, Linzi
    Li, Shangxuan
    Shang, Hongcai
    Zhou, Wu
    Zhang, Honglai
    FRONTIERS IN PHYSIOLOGY, 2022, 13
  • [36] A Contrastive Sharing Model for Multi-Task Recommendation
    Bai, Ting
    Xiao, Yudong
    Wu, Bin
    Yang, Guojun
    Yu, Hongyong
    Nie, Jian-Yun
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3239 - 3247
  • [37] Model-Protected Multi-Task Learning
    Liang, Jian
    Liu, Ziqi
    Zhou, Jiayu
    Jiang, Xiaoqian
    Zhang, Changshui
    Wang, Fei
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) : 1002 - 1019
  • [38] INSIGHTS INTO THE BEHAVIOUR OF MULTI-TASK DEEP NEURAL NETWORKS FOR MEDICAL IMAGE SEGMENTATION
    Bienias, Lukasz T.
    Guillamon, Juanjo R.
    Nielsen, Line H.
    Alstrom, Tommy S.
    2019 IEEE 29TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2019,
  • [39] Multi-Task Deep Learning for Image Segmentation Using Recursive Approximation Tasks
    Ke, Rihuan
    Bugeau, Aurelie
    Papadakis, Nicolas
    Kirkland, Mark
    Schuetz, Peter
    Schonlieb, Carola-Bibiane
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 3555 - 3567
  • [40] SIM: an improved few-shot image classification model with multi-task learning
    Guo, Jin
    Li, Wengen
    Guan, Jihong
    Gao, Hang
    Liu, Baobo
    Gong, Lili
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (03)