Multi-task Model for Comic Book Image Analysis

被引：4

作者：

Nhu-Van Nguyen ^{[1
]}

Rigaud, Christophe ^{[1
]}

Burie, Jean-Christophe ^{[1
]}

机构：

[1] Univ La Rochelle, Lab L3i, F-17042 La Rochelle 1, France

来源：

MULTIMEDIA MODELING, MMM 2019, PT II | 2019年 / 11296卷

关键词：

Comic book image analysis; Association balloon-character; Multi-task learning; CNN; Deep learning;

D O I：

10.1007/978-3-030-05716-9_57

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Comic book image analysis methods often propose multiple algorithms or models for multiple tasks like panels and characters detection, balloons segmentation and text recognition, etc. In this work, we aim to reduce the complexity for comic book image analysis by proposing one model which can learn multiple tasks called Comic MTL. In addition to the detection task and segmentation task, we integrate the relation analysis task for balloons and characters into the Comic MTL model. The experiments with our model are carried out on the eBDtheque dataset which contains the annotations for panels, balloons, characters and also the relations balloon-character. We show that the Comic MTL model can detect the association between balloons and their speakers (comic characters) and handle other tasks like panels, characters detection and balloons segmentation with promising results.

引用

页码：637 / 649

页数：13

共 50 条

[21] Multi-task gradient descent for multi-task learning
Bai, Lu
Ong, Yew-Soon
He, Tiantian
Gupta, Abhishek
MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
[22] Automatic Analysis of Transverse Musculoskeletal Ultrasound Images Based on the Multi-Task Learning Model
Zhou, Linxueying
Liu, Shangkun
Zheng, Weimin
ENTROPY, 2023, 25 (04)
[23] Image Recognition of Chinese herbal pieces Based on Multi-task Learning Model
Hu, Ji-Li
Wang, Yong-Kang
Che, Zeng-Yang
Li, Qian-Qian
Jiang, Hong-Kun
Liu, Ling-Jie
2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 1555 - 1559
[24] Multi-modal Sentiment and Emotion Joint Analysis with a Deep Attentive Multi-task Learning Model
Zhang, Yazhou
Rong, Lu
Li, Xiang
Chen, Rui
ADVANCES IN INFORMATION RETRIEVAL, PT I, 2022, 13185 : 518 - 532
[25] Semantic Communication Approach for Multi-Task Image Transmission
Zhang, Zhenguo
Yang, Qianqian
He, Shibo
Shi, Zhiguo
2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
[26] Deep multi-task learning for malware image classification
Bensaoud, Ahmed
Kalita, Jugal
JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
[27] Enhanced representation and multi-task learning for image annotation
Binder, Alexander
Samek, Wojciech
Mueller, Klaus-Robert
Kawanabe, Motoaki
COMPUTER VISION AND IMAGE UNDERSTANDING, 2013, 117 (05) : 466 - 478
[28] Multi-task based Image Aesthetics Quality Evaluation
Jiang, Min
Chen, Zhe
Jiang, Jiajun
Liu, Xiaoming
Hu, Wei
2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2755 - 2760
[29] Learning multi-task local metrics for image annotation
Xu, Xing
Shimada, Atsushi
Nagahara, Hajime
Taniguchi, Rin-ichiro
MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (04) : 2203 - 2231
[30] A Multi-task Method for Immunofixation Electrophoresis Image Classification
Shi, Yi
Li, Rui-Xiang
Shao, Wen-Qi
Duan, Xin-Cen
Ye, Han-Jia
Zhan, De-Chuan
Pan, Bai-Shen
Wang, Bei-Li
Guo, Wei
Jiang, Yuan
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VI, 2023, 14225 : 148 - 158

← 1 2 3 4 5 →