Automated Identity Document Classification

被引:0
作者
Bhatlawande, Shripad [1 ]
Shilaskar, Swati [1 ]
Gupta, Divyam [1 ]
Dupare, Prashik [1 ]
Ghode, Rutvik [1 ]
机构
[1] Vishwakarma Inst Technol, Dept Elect & Telecommun, Pune, Maharashtra, India
来源
COMMUNICATION AND INTELLIGENT SYSTEMS, VOL 3, ICCIS 2023 | 2024年 / 969卷
关键词
Dataset; Image Processing; Model Training; Model Testing; Classification; Computer Vision; Automation; Documents;
D O I
10.1007/978-981-97-2082-8_30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A model to classify documents into their categories and use them as per needs as in all the industrial documentation is needed to keep track of their employees documents and it is a frequent happening where all the documents get mixed up into each other and result in a lot of trouble and delay and leads to incurring a cost to the organization; the model presented in this paper offers a solution to this issue. A novel algorithm is implemented to tune the hyperparameter using a CNN Model as its basis using hence, created parameters to classify documents into their categories and making it easier to maintain we achieved an 81% average accuracy score which might be because of different countries belonging the datasets which have different structures as well as we have used one hot encoding which gives it a nice scope to be directly used to scale up and use in larger scale scenarios. Further, we implemented different machine learning models which provided us with a reliable accuracy of 94 percent which also proves that in a few cases, machine learning can outperform deep learning. Also it further gives us a great implacable model for real-life use cases.
引用
收藏
页码:431 / 446
页数:16
相关论文
共 20 条
[1]  
Admuthe SS, 2021, Int J Eng Res Technol (IJERT), V10
[2]  
Afzal MZ, 2015, PROC INT CONF DOC, P1111, DOI 10.1109/ICDAR.2015.7333933
[3]   An Empirical Comparison of Machine Learning Models for Time Series Forecasting [J].
Ahmed, Nesreen K. ;
Atiya, Amir F. ;
El Gayar, Neamat ;
El-Shishiny, Hisham .
ECONOMETRIC REVIEWS, 2010, 29 (5-6) :594-621
[4]   Binarization of Degraded Document Images Using Convolutional Neural Networks and Wavelet-Based Multichannel Images [J].
Akbari, Younes ;
Al-Maadeed, Somaya ;
Adam, Kalthoum .
IEEE ACCESS, 2020, 8 :153517-153534
[5]  
[Anonymous], 2022, IEEE Trans Eng Manage
[6]  
Das A, 2018, INT C PATT RECOG, P3180, DOI 10.1109/ICPR.2018.8545630
[7]   Comparison of Machine Learning Algorithms for Retrieval of Water Quality Indicators in Case-II Waters: A Case Study of Hong Kong [J].
Hafeez, Sidrah ;
Wong, Man Sing ;
Ho, Hung Chak ;
Nazeer, Majid ;
Nichol, Janet ;
Abbas, Sawaid ;
Tang, Danling ;
Lee, Kwon Ho ;
Pun, Lilian .
REMOTE SENSING, 2019, 11 (06)
[8]   Document images classification based on deep learning [J].
Hu, Biao ;
Ergu, Daji ;
Yang, Huan ;
Liu, Kuiyi ;
Cai, Ying .
7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT (ITQM 2019): INFORMATION TECHNOLOGY AND QUANTITATIVE MANAGEMENT BASED ON ARTIFICIAL INTELLIGENCE, 2019, 162 :514-522
[9]   LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking [J].
Huang, Yupan ;
Lv, Tengchao ;
Cui, Lei ;
Lu, Yutong ;
Wei, Furu .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :4083-4091
[10]  
Klosowski P, 2018, SIG P ALGO ARCH ARR, P223, DOI 10.23919/SPA.2018.8563389