Handwriting Recognition for Medical Prescriptions using a CNN-Bi-LSTM Model

被引:3
作者
Jain, Tavish [1 ]
Sharma, Rohan [1 ]
Malhotra, Ruchika [1 ]
机构
[1] Delhi Technol Univ, Dept Comp Sci & Engn, New Delhi, India
来源
2021 6TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT) | 2021年
关键词
Long-short term memory networks; convolutional networks; neural networks; connectionist temporal classification; recurrent neural networks; character error rate; batch normalization; Seq2Seq networks; Adam Optimizer; PyTorch;
D O I
10.1109/I2CT51068.2021.9418153
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
It is commonly seen that it is tough to read the handwritten text from medical prescriptions. It is mostly due to the different style of handwriting and the use of Latin abbreviations for medical terms which is usually unknown to the general public. This can make it difficult for both patients and even pharmacists to read the prescription, which can have negative or even fatal consequences if read incorrectly. This paper demonstrates the use of a CNN-Bi-LSTM model along with Connectionist Temporal Classification. The prescribed model consists of three components, the convolutional layers for feature extraction, the Bi-LSTM network for making predictions for each frame of the context vector and the final decoding to translate each character in the recognized sequence by LSTM layers into an alphabetic character using the CTC loss function. A linear layer is added after the bi-LSTM layer to compute the final probabilities, which will be decoded. We also built a corpus manually containing the terms widely used in the medical domain, commonly used in prescriptions. We then use string matching algorithms, and string distance functions to find the nearest word in the corpus, so that bias is given to medical terms for increasing accuracy of the predicted output.
引用
收藏
页数:4
相关论文
共 11 条
[1]  
[Anonymous], 2009 INT C ART INT P
[2]   Optical character recognition for cursive handwriting [J].
Arica, N ;
Yarman-Vural, FT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (06) :801-813
[3]  
Bala Abhishek, 2016, 6 INT C ADV COMP COM
[4]  
Basu Subhadip, 2009, HIERARCHICAL APPROAC
[5]  
Chumuang N., 2018, 2018 INT JOINT S ART, P1, DOI [10.1109/iSAINLP.2018.8692958, DOI 10.1109/ISAINLP.2018.8692958]
[6]  
Dave NJIjosp image processing recognition p, 2015, INT J SIGNAL PROCESS, V8, P155
[7]  
Dhande P. S., 2017, INT C COMP COMM CONT, P1, DOI 10.1109/ICCUBEA.2017.8463842
[8]  
Kamalanaban E., 2018, Inter. J. Eng. Techno. (UAE), V7, P114, DOI [10.14419/ijet.v7i3.34.18785, DOI 10.14419/IJET.V7I3.34.18785]
[9]  
Keisham Kanchan, 2016, INFORM SYSTEMS DESIG
[10]   The IAM-database: An English sentence database for offline handwriting recognition [J].
U.-V. Marti ;
H. Bunke .
International Journal on Document Analysis and Recognition, 2002, 5 (1) :39-46