Analyzing the potential of active learning for document image classification

被引：3

作者：

Saifullah, Saifullah ^{[1
,2
]}

Agne, Stefan ^{[1
,3
]}

Dengel, Andreas ^{[1
,2
]}

Ahmed, Sheraz ^{[1
,3
]}

机构：

[1] German Res Ctr Artificial Intelligence, D-67663 Kaiserslautern, Germany

[2] RPTU Kaiserslautern Landau, D-67663 Kaiserslautern, Germany

[3] DeepReader GmbH, D-67663 Kaiserslautern, Germany

来源：

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION | 2023年 / 26卷 / 03期

关键词：

Document image classification; Document analysis; Active learning; Deep active learning; NEURAL-NETWORKS;

D O I：

10.1007/s10032-023-00429-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15-40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at .

引用

页码：187 / 209

页数：23

共 50 条

[1] Analyzing the potential of active learning for document image classification
Saifullah Saifullah
Stefan Agne
Andreas Dengel
Sheraz Ahmed
International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 187 - 209
[2] Analyzing the Potential of Zero-Shot Recognition for Document Image Classification
Siddiqui, Shoaib Ahmed
Dengel, Andreas
Ahmed, Sheraz
DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 293 - 304
[3] Explorations in Active Learning Applied to Image Classification
Klimczak, Adriana
Wenka, Marcel
Ganzha, Maria
Paprzycki, Marcin
BIG DATA ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2022, 2023, 13830 : 17 - 30
[4] Unsupervised Exemplar-Based Learning for Improved Document Image Classification
Abuelwafa, Sherif
Pedersoli, Marco
Cheriet, Mohamed
IEEE ACCESS, 2019, 7 : 133738 - 133748
[5] DEEP ACTIVE LEARNING FOR IMAGE CLASSIFICATION
Ranganathan, Hiranmayi
Venkateswara, Hemanth
Chakraborty, Shayok
Panchanathan, Sethuraman
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, : 3934 - 3938
[6] ACTIVE MANIFOLD LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION
Zhang, Zhou
Taskin, Gulsen
Crawford, Melba M.
IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 2587 - 2590
[7] Scalable Active Learning for Multiclass Image Classification
Joshi, Ajay J.
Porikli, Fatih
Papanikolopoulos, Nikolaos P.
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2259 - 2273
[8] MULTIPLE KERNEL ACTIVE LEARNING FOR IMAGE CLASSIFICATION
Yang, Jingjing
Li, Yuanning
Tian, Yonghong
Duan, Lingyu
Gao, Wen
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 550 - +
[9] Active Learning in Social Context for Image Classification
Chatzilari, Elisavet
Nikolopoulos, Spiros
Kompatsiaris, Yiannis
Kittler, Josef
PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 76 - 85
[10] SALIC: Social Active Learning for Image Classification
Chatzilari, Elisavet
Nikolopoulos, Spiros
Kompatsiaris, Yiannis
Kittler, Josef
IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (08) : 1488 - 1503

← 1 2 3 4 5 →