A Text Detection Algorithm for Image of Student Exercises Based on CTPN and Enhanced YOLOv3

被引：11

作者：

Cao, Langcai ^{[1
,2
]}

Li, Hongwei ^{[1
]}

Xie, Rongbiao ^{[1
]}

Zhu, Jinrong ^{[1
]}

机构：

[1] Xiamen Univ, Dept Automat, Xiamen 361005, Peoples R China

[2] Xiamen Key Lab Big Data Intelligent Anal & Decis, Xiamen 361005, Peoples R China

来源：

IEEE ACCESS | 2020年 / 8卷

基金：

中国国家自然科学基金;

关键词：

Text recognition; Databases; Splicing; Detectors; Feature extraction; Object detection; Detection algorithms; Text detection; exercise image; YOLOv3; CTPN; OCR platform; stitching algorithm;

D O I：

10.1109/ACCESS.2020.3025221

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Intelligent learning system (ILS) has become a popular learning tool for students. It can collect students' wrong questions in exercises and dig out their unskilled knowledge points so that it can recommend personalized exercises for students. Detecting text accurately from images of students' exercises is significant and essential in an ILS. However, a big challenge of text detection is that traditional text detection algorithms can not detect complete text lines in an exercise scene, and their detection box always splits between Chinese and mathematical symbols. In this article, we propose a deep-learning-based approach for text detection, which improves You Only Look Once version 3 (YOLOv3) by changing the regression object from a single character to a fixed-width text and applies a stitching strategy to construct text lines based on the relation matrix, which improves the accuracy by 9.8%. Experimental results on both RCTW Chinese text detection dataset and real exercise scenario show that our model can improve detection effectiveness. In addition, we compare our method with two state-of-the-art approaches in applications of exercise text detection, and discuss its capability and limitations. We have also provided a platform which has implemented the proposal for detecting text lines in students' daily homework or examination papers, which enhances user experience well.

引用

页码：176924 / 176934

页数：11

共 29 条

[1] Character Region Awareness for Text Detection [J].

Baek, Youngmin ;

Lee, Bado ;

Han, Dongyoon ;

Yun, Sangdoo ;

Lee, Hwalsuk .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9357-9366

[2] SE-IYOLOV3: An Accurate Small Scale Face Detector for Outdoor Security [J].

Deng, Zhenrong ;

Yang, Rui ;

Lan, Rushi ;

Liu, Zhenbing ;

Luo, Xiaonan .

MATHEMATICS, 2020, 8 (01)

[3]

Epshtein B, 2010, PROC CVPR IEEE, P2963, DOI 10.1109/CVPR.2010.5540041

[4] Fast R-CNN [J].

Girshick, Ross .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448

[5] Rich feature hierarchies for accurate object detection and semantic segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587

[6] RECOGNITION OF RAISED CHARACTERS FOR AUTOMATIC CLASSIFICATION OF RUBBER TIRES [J].

HAM, YK ;

KANG, MS ;

CHUNG, HK ;

PARK, RH ;

PARK, GT .

OPTICAL ENGINEERING, 1995, 34 (01) :102-109

[7]

He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]

[8] Learning non-maximum suppression [J].

Hosang, Jan ;

Benenson, Rodrigo ;

Schiele, Bernt .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6469-6477

[9] ImageNet Classification with Deep Convolutional Neural Networks [J].

Krizhevsky, Alex ;

Sutskever, Ilya ;

Hinton, Geoffrey E. .

COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90

[10] TextBoxes plus plus : A Single-Shot Oriented Scene Text Detector [J].

Liao, Minghui ;

Shi, Baoguang ;

Bai, Xiang .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3676-3690

← 1 2 3 →