Supervised Attention Network for Arbitrary-Shaped Text Detection in Edge-Fainted Noisy Scene Images

被引:3
作者
Soni, Aishwarya [1 ]
Dutta, Tanima [1 ]
Nigam, Nitika [1 ]
Verma, Deepali [1 ]
Gupta, Hari Prabhat [1 ]
机构
[1] IIT BHU, Dept Comp Sci & Engn, Varanasi 221005, Uttar Pradesh, India
关键词
Image edge detection; Semantics; Feature extraction; Noise measurement; Detectors; Task analysis; Lighting; Deep neural networks; fainted edges; multiattention networks; scene text detection;
D O I
10.1109/TCSS.2022.3153557
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Text mining in noisy situations, like poor contrast and fainted edges, is one of the challenging areas of research in the domain of social networks and computer vision. Scene text detection is a complicated task as text regionhaving varying span in term of size, orientation, aspect ratio, color, font, and script. Furthermore, the contrast of a scene image varies drastically in noisy situations due to poor illumination and image filtering. This faints the text edges and make the task of detection more challenging. In this article, we bring forward a semantic edge supervised spatial-channel attention network, known as SESANet, for detecting arbitrary-shaped text instances in noisy scene images with faint text edges. Our network learns multiscale (MS) supervised edge semantic, pixel-wise spatial structure information, and interchannel dependencies for precisely localizing the text masks in scene images with poor contrast and illumination. Our network is efficient, precise, and fast in nature. SESANet captures rich, dense, discriminative, and MS semantic information. The experimental results show the success of the proposed network. It shows a superior performance with regard to recall on the publicly available benchmark datasets. A new dataset scene images, named as Edge-fainted Noisy Arbitrary-shaped Scene Text (EFNAST) dataset, having varying noise density, poor contrast, low illumination, and faint edges is created.
引用
收藏
页码:1179 / 1188
页数:10
相关论文
共 61 条
[51]   Arbitrary Shape Scene Text Detection with Adaptive Text Region Representation [J].
Wang, Xiaobing ;
Jiang, Yingying ;
Luo, Zhenbo ;
Liu, Cheng-Lin ;
Choi, Hyunsoo ;
Kim, Sungjin .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :6442-6451
[52]  
Wu W., ARXIV190412640, V2019
[53]   TextField: Learning a Deep Direction Field for Irregular Scene Text Detection [J].
Xu, Yongchao ;
Wang, Yukang ;
Zhou, Wei ;
Wang, Yongpan ;
Yang, Zhibo ;
Bai, Xiang .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) :5566-5579
[54]  
Xue CH, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P989
[55]   Accurate Scene Text Detection Through Border Semantics Awareness and Bootstrapping [J].
Xue, Chuhui ;
Lu, Shijian ;
Zhan, Fangneng .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :370-387
[56]  
Yao C, 2012, PROC CVPR IEEE, P1083, DOI 10.1109/CVPR.2012.6247787
[57]   CASENet: Deep Category-Aware Semantic Edge Detection [J].
Yu, Zhiding ;
Feng, Chen ;
Liu, Ming-Yu ;
Ramalingam, Srikumar .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1761-1770
[58]   Look More Than Once: An Accurate Detector for Text of Arbitrary Shapes [J].
Zhang, Chengquan ;
Liang, Borong ;
Huang, Zuming ;
En, Mengyi ;
Han, Junyu ;
Ding, Errui ;
Ding, Xinghao .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10544-10553
[59]   EAST: An Efficient and Accurate Scene Text Detector [J].
Zhou, Xinyu ;
Yao, Cong ;
Wen, He ;
Wang, Yuzhi ;
Zhou, Shuchang ;
He, Weiran ;
Liang, Jiajun .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2642-2651
[60]   Fourier Contour Embedding for Arbitrary-Shaped Text Detection [J].
Zhu, Yiqin ;
Chen, Jianyong ;
Liang, Lingyu ;
Kuang, Zhanghui ;
Jin, Lianwen ;
Zhang, Wayne .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3122-3130