A deep learning-based solution for digitization of invoice images with automatic invoice generation and labelling

被引:1
作者
Arslan, Halil [1 ]
Isik, Yunus Emre [2 ]
Gormez, Yasin [2 ]
机构
[1] Sivas Cumhuriyet Univ, Dept Comp Engn, TR-58140 Sivas, Turkiye
[2] Sivas Cumhuriyet Univ, Dept Management Informat Syst, TR-58140 Sivas, Turkiye
关键词
Invoice processing; Digitalization; Object detection; Automatic generation; Automatic labelling;
D O I
10.1007/s10032-023-00449-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, the level of invoice traffic between companies has reached enormous levels. Invoices are crucial financial documents for companies, and they need to extract this information from these documents to access and control them quickly when necessary. While electronic invoices can be easily transferred to the company's ERP system with the help of integrators, information from printed invoices must be entered into the ERP system. Information entry is generally performed manually by company employees, so the probability of error is high. The automatic recognition of information in printed invoices will reduce the possibility of error. It will also save time and money by reducing workforce requirements. This study proposes a deep learning-based solution for detecting fields in image invoices that are in high demand among businesses. The system offers an end-to-end solution, which includes a novel method for generating synthetic invoices and automatic labeling. Three invoice templates were used to evaluate the usability of the system and an adaptive fine-tuning-based solution is proposed for newly coming invoice templates. Furthermore, 6 different object detection models were compared to find the most suitable one for our problem. The system was also tested with 1022 real invoice images that were manually labeled to test real-world usage. The results indicated that the fine-tuned model achieved an accuracy that was 8.4% higher than the baseline models. In tests performed on CPU, TOOD and Cascade-RCNN models were the most successful algorithms, while YOLOv5 was the fastest running algorithm. Depending on the priority of the needs, both algorithms can be preferred for real-time usage in the detection of invoice fields. The synthetic invoice generation code is available at https://github.com/SCU-CENG/Invoice-Generation.
引用
收藏
页码:97 / 109
页数:13
相关论文
共 34 条
[1]  
Adhikari B, 2018, EUR W VIS INF PROCES
[2]   End to End Invoice Processing Application Based on Key Fields Extraction [J].
Arslan, Halil .
IEEE ACCESS, 2022, 10 :78398-78413
[3]   Multi-Layout Unstructured Invoice Documents Dataset: A Dataset for Template-Free Invoice Processing and Its Evaluation Using AI Approaches [J].
Baviskar, Dipali ;
Ahirrao, Swati ;
Kotecha, Ketan .
IEEE ACCESS, 2021, 9 :101494-101512
[4]   Multi-Layout Invoice Document Dataset (MIDD): A Dataset for Named Entity Recognition [J].
Baviskar, Dipali ;
Ahirrao, Swati ;
Kotecha, Ketan .
DATA, 2021, 6 (07)
[5]  
Bochkovskiy A, 2020, PREPRINT, DOI 10.48550/ARXIV.2004.10934
[6]   Albumentations: Fast and Flexible Image Augmentations [J].
Buslaev, Alexander ;
Iglovikov, Vladimir I. ;
Khvedchenya, Eugene ;
Parinov, Alex ;
Druzhinin, Mikhail ;
Kalinin, Alexandr A. .
INFORMATION, 2020, 11 (02)
[7]   Cascade R-CNN: High Quality Object Detection and Instance Segmentation [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (05) :1483-1498
[8]   Annotating Object Instances with a Polygon-RNN [J].
Castrejon, Lluis ;
Kundu, Kaustav ;
Urtasun, Raquel ;
Fidler, Sanja .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4485-4493
[9]   Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus [J].
Acuna, David ;
Ling, Huan ;
Kar, Amlan ;
Fidler, Sanja .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :859-868
[10]   Automatic image annotation for fluorescent cell nuclei segmentation [J].
Englbrecht, Fabian ;
Ruider, Iris E. ;
Bausch, Andreas R. .
PLOS ONE, 2021, 16 (04)