Analyzing the potential of active learning for document image classification

被引:3
作者
Saifullah, Saifullah [1 ,2 ]
Agne, Stefan [1 ,3 ]
Dengel, Andreas [1 ,2 ]
Ahmed, Sheraz [1 ,3 ]
机构
[1] German Res Ctr Artificial Intelligence, D-67663 Kaiserslautern, Germany
[2] RPTU Kaiserslautern Landau, D-67663 Kaiserslautern, Germany
[3] DeepReader GmbH, D-67663 Kaiserslautern, Germany
关键词
Document image classification; Document analysis; Active learning; Deep active learning; NEURAL-NETWORKS;
D O I
10.1007/s10032-023-00429-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has been extensively researched in the field of document analysis and has shown excellent performance across a wide range of document-related tasks. As a result, a great deal of emphasis is now being placed on its practical deployment and integration into modern industrial document processing pipelines. It is well known, however, that deep learning models are data-hungry and often require huge volumes of annotated data in order to achieve competitive performances. And since data annotation is a costly and labor-intensive process, it remains one of the major hurdles to their practical deployment. This study investigates the possibility of using active learning to reduce the costs of data annotation in the context of document image classification, which is one of the core components of modern document processing pipelines. The results of this study demonstrate that by utilizing active learning (AL), deep document classification models can achieve competitive performances to the models trained on fully annotated datasets and, in some cases, even surpass them by annotating only 15-40% of the total training dataset. Furthermore, this study demonstrates that modern AL strategies significantly outperform random querying, and in many cases achieve comparable performance to the models trained on fully annotated datasets even in the presence of practical deployment issues such as data imbalance, and annotation noise, and thus, offer tremendous benefits in real-world deployment of deep document classification models. The code to reproduce our experiments is publicly available at .
引用
收藏
页码:187 / 209
页数:23
相关论文
共 50 条
[31]   Multi-label Active Learning for Image Classification [J].
Wu, Jian ;
Sheng, Victor S. ;
Zhang, Jing ;
Zhao, Pengpeng ;
Cui, Zhiming .
2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, :5227-5231
[32]   Batch Mode Active Learning for Geographical Image Classification [J].
Wang, Zengmao ;
Du, Bo ;
Zhang, Lefei ;
Hu, Wenbin ;
Tao, Dacheng ;
Zhang, Liangpei .
WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015), 2015, 9313 :744-755
[33]   A Two -Stage Active Learning Method for Image Classification [J].
Wang, Feiyue ;
Li, Xu ;
Zhang, Yifan ;
Wei, Baoguo ;
Li, Lixin .
2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, :1134-1139
[34]   A Novel Active Learning Algorithm for Robust Image Classification [J].
Xiong, Xingliang ;
Fan, Mingyu ;
Yu, Chuang ;
Hong, Zhenjie .
IEEE ACCESS, 2020, 8 :71106-71116
[35]   COMBINING ACTIVE AND METRIC LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION [J].
Pasolli, Edoardo ;
Yang, Hsiuhan Lexie ;
Crawford, Melba M. .
2014 6TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2014,
[36]   Integrating Multiple Information of Active Learning for Image Classification [J].
Xu, Haihui ;
Zhao, Pengpeng ;
Wu, Jian ;
Cui, Zhiming ;
Li, Chengchao .
2013 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC), 2013, :374-379
[37]   DEFENDING ACTIVE LEARNING AGAINST ADVERSARIAL INPUTS IN AUTOMATED DOCUMENT CLASSIFICATION [J].
Pi, Lei ;
Lu, Zhuo ;
Sagduyu, Yalin ;
Chen, Su .
2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, :257-261
[38]   A document image classification system fusing deep and machine learning models [J].
Sevinç İlhan Omurca ;
Ekin Ekinci ;
Semih Sevim ;
Eren Berk Edinç ;
Süleyman Eken ;
Ahmet Sayar .
Applied Intelligence, 2023, 53 :15295-15310
[39]   A document image classification system fusing deep and machine learning models [J].
Omurca, Sevinc Ilhan ;
Ekinci, Ekin ;
Sevim, Semih ;
Edinc, Eren Berk ;
Eken, Suleyman ;
Sayar, Ahmet .
APPLIED INTELLIGENCE, 2023, 53 (12) :15295-15310
[40]   Active Ensemble Deep Learning for Polarimetric Synthetic Aperture Radar Image Classification [J].
Liu, Sheng-Jie ;
Luo, Haowen ;
Shi, Qian .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (09) :1580-1584