Optical Character Recognition for Medical Records Digitization with Deep Learning

被引:3
作者
Zaryab, Muhammad Ateeque [1 ]
Ng, Chuen Rue [1 ]
机构
[1] Tech Univ Ilmenau, Inst Biomed Engn, Ilmenau, Germany
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
OCR; Deep Learning; Computer Vision; Text Recognition;
D O I
10.1109/ICIP49359.2023.10222038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of document digitization has increased due to recent technological advancements, including in the medical field. Digitization of medical records plays a vital role in the healthcare sector as it helps expedite emergency treatment. Due to the scarcity of published studies and public German textual resources, a medical records database with German handwriting was collected and digitized. In this study, document digitization was accomplished by implementing deep learning, region of interest (ROI) detection, and optical character recognition (OCR) on a dataset containing medical forms filled with German and English characters. To find the best model for ROI detection, YOLOv5, and SSDResNet50 models were utilized and compared with YOLOv5 producing a better mean average precision (mAP) of 0.91. OCR was then carried out on the output from YOLOv5 with two different methods again for comparison. The Gated-CNN-BLSTM algorithm yielded a character error rate (CER) of 9%, while transformer-based OCR (TrOCR) achieved a CER of 6%. The proposed system could be implemented and further tested in local hospitals, with the OCR dictionary being expandable to include other Roman character-based languages.
引用
收藏
页码:3260 / 3263
页数:4
相关论文
共 10 条
[1]  
[Anonymous], 2016, Journal of Theoretical & Applied Information Technology
[2]   Handwritten Digit Recognition of MNIST dataset using Deep Learning state-of-the-art Artificial Neural Network (ANN) and Convolutional Neural Network (CNN) [J].
Beohar, Drishti ;
Rasool, Akhtar .
2021 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2021, :542-548
[3]   Gated Convolutional Recurrent Neural Networks for Multilingual Handwriting Recognition [J].
Bluche, Theodore ;
Messina, Ronaldo .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :646-651
[4]  
Murphy E, 2021, 2021 SYSTEMS AND INFORMATION ENGINEERING DESIGN SYMPOSIUM (IEEE SIEDS 2021), P23, DOI 10.1109/SIEDS52267.2021.9483723
[5]  
Nagarikar A., 2021, REV GEINTEC, V11, P4405
[6]   Are Multidimensional Recurrent Layers Really Necessary for Handwritten Text Recognition? [J].
Puigcerver, Joan .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :67-72
[7]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[8]  
Singh A., 2012, INT J MACH LEARN COM, V2, P314, DOI [10.7763/IJMLC.2012.V2.137, DOI 10.7763/IJMLC.2012.V2.137]
[9]  
Torrey L. A., 2009, 11 TRANSFER LEARNING
[10]   Secure and Efficient User Authentication Scheme Based on Password and Smart Card for Multiserver Environment [J].
Zhao, Yan ;
Li, Shiming ;
Jiang, Liehui .
SECURITY AND COMMUNICATION NETWORKS, 2018,