A New Arbitrary-shaped Text Detection Network by Reinforcing Edge Features

被引:0
作者
Bai H.-X. [1 ]
Wang H.-R. [1 ]
机构
[1] School of Computer and Information Technology, Shanxi University, Taiyuan
来源
Zidonghua Xuebao/Acta Automatica Sinica | 2023年 / 49卷 / 05期
基金
中国国家自然科学基金;
关键词
arbitrary-shaped; edge region; progressive scale expansion network (PSENet); Scene text detection; shallow feature;
D O I
10.16383/j.aas.c220429
中图分类号
学科分类号
摘要
In the detection of scene texts areas, the text instances'edge features are processed in the same way as other features. Nevertheless, the accurate detection of adjacent text edges is crucial in the correct identification of arbitrary-shaped text regions in natural scenes. Obviously, the identification accuracy increases if edge features can be enhanced and modeled through independent branches in the network. To this end, three network modules are proposed to enhance the edge features in this paper. These modules are the shallow feature enhancement module which effectively enhances the shallow features with more edge features, the edge region detection module which decouples the original features into edge features and text features to explicitly model the edge features of the object, and the branch feature fusion module which effectively fuses these two types of features in the recognition process. After the proposed modules are added to the progressive scale expansion network (PSENet), the ablation experiments show that both the independent application and the synthetic application of these modules increase the prediction accuracy. In addition, the comparison experiments on three commonly used public datasets with ten state-of-the-art methods show that the improved edge-oriented feature reinforcing network (EFRNet) has higher F1-measure accuracy. © 2023 Science Press. All rights reserved.
引用
收藏
页码:1019 / 1030
页数:11
相关论文
共 45 条
[1]  
Lyu P Y, Liao M H, Yao C, Wu W H, Bai X., Mask TextSpotter: An end-to-end trainable neural network for spotting text with arbitrary shapes, Proceedings of the 15th European Conference on Computer Vision, pp. 71-88, (2018)
[2]  
He T, Huang W L, Qiao Y, Yao J., Text-attentional convolutional neural network for scene text detection, IEEE Transactions on Image Processing, 25, 6, pp. 2529-2541, (2016)
[3]  
Qin S Y, Manduchi R., Cascaded segmentation-detection networks for word-level text spotting, Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), pp. 1275-1282, (2017)
[4]  
Cho H, Sung M, Jun B., Canny text detector: Fast and robust scene text localization algorithm, Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3566-3573, (2016)
[5]  
Tian S X, Pan Y F, Huang C, Lu S J, Yu K, Tan C L., Text flow: A unified text detection system in natural scene images, Proceedings of the International Conference on Computer Vision (ICCV), pp. 4651-4659, (2015)
[6]  
Wang Run-Min, Sang Nong, Ding Ding, Chen Jie, Ye Qi-Xiang, Gao Chang-Xin, Et al., Text detection in natural scene image: A survey, Acta Automatica Sinica, 44, 12, pp. 2113-2141, (2018)
[7]  
Liu Y L, Jin L W., Deep matching prior network: Toward tighter multi-oriented text detection, Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3454-3461, (2017)
[8]  
Zhang Z, Zhang C Q, Shen W, Yao C, Liu W Y, Bai X., Multioriented text detection with fully convolutional networks, Proceedings of the International Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4159-4167, (2016)
[9]  
Zhong Z Y, Jin L W, Huang S P., DeepText: A new approach for text proposal generation and text detection in natural images, Proceedings of the International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1208-1212, (2017)
[10]  
Tian Z, Huang W L, He T, He P, Qiao Y., Detecting text in natural image with connectionist text proposal network, Proceedings of the 14th European Conference on Computer vision, pp. 56-72, (2016)