End to End Invoice Processing Application Based on Key Fields Extraction

被引：4

作者：

Arslan, Halil ^{[1
]}

机构：

[1] Sivas Cumhuriyet Univ, Engn Fac, Dept Comp Engn, Sivas, Turkey

来源：

IEEE ACCESS | 2022年 / 10卷

关键词：

Optical character recognition software; Data mining; Business; Image processing; Deep learning; Companies; Character recognition; Invoice processing; key fields extraction; text detection; deep learning; table extraction; optical character recognition; TABLE DETECTION; DOCUMENT;

D O I：

10.1109/ACCESS.2022.3192828

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, an automatic invoice processing system, which is in great demand among private and public companies, was proposed. The proposed system supports all invoice file types that can be submitted by companies. Companies can easily submit invoices to the system via the web interface or email, and all invoices submitted to the system are queued and processed sequentially. If the invoice is a text file, the invoice information is extracted from the text by using template matching. If the invoice is an image, the text and table areas are detected and extracted. For table detection, we used both image processing based and YOLOv5-based deep learning method. Cell extraction was then performed from the extracted table images. As a result of these processes, all text and table cells were obtained as images and these images were converted into machine-readable text using the open-source software Tesseract OCR. Tesseract already provides trained models for English and Turkish. However, these models do not provide successful results for invoices submitted by companies in Turkish. Therefore, the new fine-tuned model trained with invoices in Turkish was used for OCR. The experimental results showed that the trained Turkish model was more accurate than the Turkish and English models provided by Tesseract. In addition, the YOLOv5-based table detection model was more accurate than the image-processing-based table detection method.

引用

页码：78398 / 78413

页数：16

共 41 条

[1]

[Anonymous], 2021, ACTIVEMQ ARTEMIS

[2]

Antolovic D., 2008, TR663 IBM DEP COMP S

[3] Multi-Layout Unstructured Invoice Documents Dataset: A Dataset for Template-Free Invoice Processing and Its Evaluation Using AI Approaches [J].

Baviskar, Dipali ;

Ahirrao, Swati ;

Kotecha, Ketan .

IEEE ACCESS, 2021, 9 :101494-101512

[4] BINYAS: a complex document layout analysis system [J].

Bhowmik, Showmik ;

Kundu, Soumyadeep ;

Sarkar, Ram .

MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (06) :8471-8504

[5] Text and non-text separation in offline document images: a survey [J].

Bhowmik, Showmik ;

Sarkar, Ram ;

Nasipuri, Mita ;

Doermann, David .

INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2018, 21 (1-2) :1-20

[6]

Bochkovskiy A., 2020, PREPRINT

[7] The Benefits of Close-Domain Fine-Tuning for Table Detection in Document Images [J].

Casado-Garcia, Angela ;

Dominguez, Cesar ;

Heras, Jonathan ;

Mata, Eloy ;

Pascual, Vico .

DOCUMENT ANALYSIS SYSTEMS, 2020, 12116 :199-215

[8]

Doermann D., HDB DOCUMENT IMAGE P, DOI [10.5555/2632841, DOI 10.5555/2632841]

[9]

Douglas D.H., 2006, Cartographica: The International Journal for Geographic Information and Geovisualization, DOI DOI 10.3138/FM57-6770-U75U-7727

[10]

Gangal A., 2021, ARXIV

← 1 2 3 4 5 →