A Text Detection Algorithm for Image of Student Exercises Based on CTPN and Enhanced YOLOv3

被引:11
作者
Cao, Langcai [1 ,2 ]
Li, Hongwei [1 ]
Xie, Rongbiao [1 ]
Zhu, Jinrong [1 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China
[2] Xiamen Key Lab Big Data Intelligent Anal & Decis, Xiamen 361005, Peoples R China
基金
中国国家自然科学基金;
关键词
Text recognition; Databases; Splicing; Detectors; Feature extraction; Object detection; Detection algorithms; Text detection; exercise image; YOLOv3; CTPN; OCR platform; stitching algorithm;
D O I
10.1109/ACCESS.2020.3025221
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Intelligent learning system (ILS) has become a popular learning tool for students. It can collect students' wrong questions in exercises and dig out their unskilled knowledge points so that it can recommend personalized exercises for students. Detecting text accurately from images of students' exercises is significant and essential in an ILS. However, a big challenge of text detection is that traditional text detection algorithms can not detect complete text lines in an exercise scene, and their detection box always splits between Chinese and mathematical symbols. In this article, we propose a deep-learning-based approach for text detection, which improves You Only Look Once version 3 (YOLOv3) by changing the regression object from a single character to a fixed-width text and applies a stitching strategy to construct text lines based on the relation matrix, which improves the accuracy by 9.8%. Experimental results on both RCTW Chinese text detection dataset and real exercise scenario show that our model can improve detection effectiveness. In addition, we compare our method with two state-of-the-art approaches in applications of exercise text detection, and discuss its capability and limitations. We have also provided a platform which has implemented the proposal for detecting text lines in students' daily homework or examination papers, which enhances user experience well.
引用
收藏
页码:176924 / 176934
页数:11
相关论文
共 29 条
[21]   YOLO9000: Better, Faster, Stronger [J].
Redmon, Joseph ;
Farhadi, Ali .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6517-6525
[22]   You Only Look Once: Unified, Real-Time Object Detection [J].
Redmon, Joseph ;
Divvala, Santosh ;
Girshick, Ross ;
Farhadi, Ali .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :779-788
[23]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[24]  
Roy Partha Pratim, 2009, 2009 10th International Conference on Document Analysis and Recognition (ICDAR), P11, DOI 10.1109/ICDAR.2009.124
[25]   An overview of the tesseract OCR engine [J].
Smith, Ray .
ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, :629-633
[26]   Detecting Text in Natural Image with Connectionist Text Proposal Network [J].
Tian, Zhi ;
Huang, Weilin ;
He, Tong ;
He, Pan ;
Qiao, Yu .
COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 :56-72
[27]   Shape Robust Text Detection with Progressive Scale Expansion Network [J].
Wang, Wenhai ;
Xie, Enze ;
Li, Xiang ;
Hou, Wenbo ;
Lu, Tong ;
Yu, Gang ;
Shao, Shuai .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9328-9337
[28]   Multi-Oriented Text Detection with Fully Convolutional Networks [J].
Zhang, Zheng ;
Zhang, Chengquan ;
Shen, Wei ;
Yao, Cong ;
Liu, Wenyu ;
Bai, Xiang .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4159-4167
[29]   Design and Analysis of Refined Inspection of Field Conditions of Oilfield Pumping Wells Based on Rotorcraft UAV Technology [J].
Zhou, Yu ;
Wu, Chunxue ;
Wu, Qunhui ;
Eli, Zelda Makati ;
Xiong, Naixue ;
Zhang, Sheng .
ELECTRONICS, 2019, 8 (12)