The Industrial Application of Artificial Intelligence-Based Optical Character Recognition in Modern Manufacturing Innovations

被引:4
作者
Tang, Qing [1 ]
Lee, Youngseok [1 ]
Jung, Hail [2 ]
机构
[1] INTERX, Data Sci Grp, Ulsan 44542, South Korea
[2] Seoul Natl Univ Sci & Technol, Dept Business Adm, Seoul 01811, South Korea
关键词
artificial intelligence; optical character recognition; manufacturing innovation; manufacturing industrial; real-world application; SYSTEM;
D O I
10.3390/su16052161
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper presents the development of a comprehensive, on-site industrial Optical Character Recognition (OCR) system tailored for reading text on iron plates. Initially, the system utilizes a text region detection network to identify the text area, enabling camera adjustments along the x and y axes and zoom enhancements for clearer text imagery. Subsequently, the detected text region undergoes line-by-line division through a text segmentation network. Each line is then transformed into rectangular patches for character recognition by the text recognition network, comprising a vision-based text recognition model and a language network. The vision network performs preliminary recognition, followed by refinement through the language model. The OCR results are then converted into digital characters and recorded in the iron plate registration system. This paper's contributions are threefold: (1) the design of a comprehensive, on-site industrial OCR system for autonomous registration of iron plates; (2) the development of a realistic synthetic image generation strategy and a robust data augmentation strategy to address data scarcity; and (3) demonstrated impressive experimental results, indicating potential for on-site industrial applications. The designed autonomous system enhances iron plate registration efficiency and significantly reduces factory time and labor costs.
引用
收藏
页数:20
相关论文
共 49 条
[41]   Shape Robust Text Detection with Progressive Scale Expansion Network [J].
Wang, Wenhai ;
Xie, Enze ;
Li, Xiang ;
Hou, Wenbo ;
Lu, Tong ;
Yu, Gang ;
Shao, Shuai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9328-9337
[42]   Intelligent Micron Optical Character Recognition of DFB Chip Using Deep Convolutional Neural Network [J].
Wang, Xudong ;
Li, Yebin ;
Liu, Juanxiu ;
Zhang, Jing ;
Du, Xiaohui ;
Liu, Lin ;
Liu, Yong .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[43]   A Review of Further Directions for Artificial Intelligence, Machine Learning, and Deep Learning in Smart Logistics [J].
Woschank, Manuel ;
Rauch, Erwin ;
Zsifkovits, Helmut .
SUSTAINABILITY, 2020, 12 (09)
[44]  
Xue CH, 2018, Arxiv, DOI arXiv:1807.03547
[45]   Improved local binary pattern for real scene optical character recognition [J].
Yang, Chu-Sing ;
Yang, Yung-Hsuan .
PATTERN RECOGNITION LETTERS, 2017, 100 :14-21
[46]  
Zhang C, 2023, IEEE T INSTRUM MEAS, V72, DOI [10.1109/TIM.2023.3315392, 10.1109/TIM.2023.3300474]
[47]   Hyperspectral pathology image classification using dimension-driven multi-path attention residual network [J].
Zhang, Xueyu ;
Li, Wei ;
Gao, Chenzhong ;
Yang, Yue ;
Chang, Kan .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 230
[48]  
Zhang Z., 2016, P IEEE C COMPUTER VI
[49]   Fourier Contour Embedding for Arbitrary-Shaped Text Detection [J].
Zhu, Yiqin ;
Chen, Jianyong ;
Liang, Lingyu ;
Kuang, Zhanghui ;
Jin, Lianwen ;
Zhang, Wayne .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3122-3130