OCR for Drawing Images Using Bidirectional LSTM with CTC

被引:0
作者
Shin, Hee-Ran [1 ]
Park, Jang-Sik [1 ]
Song, Jong-Kwan [1 ]
机构
[1] Kyungsung Univ, Elect Engn, Busan, South Korea
来源
2019 IEEE STUDENT CONFERENCE ON ELECTRIC MACHINES AND SYSTEMS (SCEMS 2019) | 2019年
关键词
Deep learning; Convolutional Neural Network; Long short-term memory; CTC; OCR; Mechanical drawing;
D O I
10.1109/scems201947376.2019.8972628
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Nowadays, the system is changing with the Optical Character Recognition (OCR) technology in various industrial. It is technology that allows computers detect and convert form handwritten or scanned images into searchable machine encoded data. Searchable data can save a huge amount of time and effort. Therefore, the machine industry has also improved portability and accessibility by converting existing dealer customers' mechanical parts drawing books into mobile. Dealers can easily order mechanical parts without looking for drawing, just search a serial number of mechanical parts on the database. In this paper, we propose the OCR on the drawing images to convert images into searchable data. Proposed OCR consist of three parts: pre-processing, deep learning and post-processing. The pre-processing part removed the guide lines and shape for improving the precision. The guide lines and shape usually cause false recognition as '1' or '7'. The deep learning part extract features and classify the images. We adopt Shi, et al.'s CRNN architecture [1]. The post-processing, drop the low probability of object. Most of left lines and shapes are seem to the low probability. As result of proposed OCR, it shows 98.52% of average recall rate and 92.25% average precision.
引用
收藏
页数:4
相关论文
共 7 条
[1]  
[Anonymous], P 3 INT C LEARNING R
[2]  
Graves A., 2006, P 23 INT C MACHINE L, P369
[3]  
Graves A, 2013, INT CONF ACOUST SPEE, P6645, DOI 10.1109/ICASSP.2013.6638947
[4]   A Novel Connectionist System for Unconstrained Handwriting Recognition [J].
Graves, Alex ;
Liwicki, Marcus ;
Fernandez, Santiago ;
Bertolami, Roman ;
Bunke, Horst ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (05) :855-868
[5]  
Hochreiter S, 1997, Neural Computation, V9, P1735
[6]   Connectionist Temporal Modeling for Weakly Supervised Action Labeling [J].
Huang, De-An ;
Li Fei-Fei ;
Niebles, Juan Carlos .
COMPUTER VISION - ECCV 2016, PT IV, 2016, 9908 :137-153
[7]   An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition [J].
Shi, Baoguang ;
Bai, Xiang ;
Yao, Cong .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) :2298-2304