Irregular Scene Text Detection Based on a Graph Convolutional Network

被引:2
作者
Zhang, Shiyu [1 ,2 ]
Zhou, Caiying [1 ]
Li, Yonggang [2 ]
Zhang, Xianchao [3 ]
Ye, Lihua [2 ]
Wei, Yuanwang [2 ,3 ]
机构
[1] Jiangxi Univ Sci & Technol, Coll Sci, Ganzhou 341000, Peoples R China
[2] Jiaxing Univ, Coll Informat Sci & Engn, Jiaxing 314001, Peoples R China
[3] Jiaxing Univ, Key Lab Med Elect & Digital Hlth Zhejiang Prov, Jiaxing 314001, Peoples R China
关键词
text detection; scene image; irregular; relation inference; GCN;
D O I
10.3390/s23031070
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Detecting irregular or arbitrary shape text in natural scene images is a challenging task that has recently attracted considerable attention from research communities. However, limited by the CNN receptive field, these methods cannot directly capture relations between distant component regions by local convolutional operators. In this paper, we propose a novel method that can effectively and robustly detect irregular text in natural scene images. First, we employ a fully convolutional network architecture based on VGG16_BN to generate text components via the estimated character center points, which can ensure a high text component detection recall rate and fewer noncharacter text components. Second, text line grouping is treated as a problem of inferring the adjacency relations of text components with a graph convolution network (GCN). Finally, to evaluate our algorithm, we compare it with other existing algorithms by performing experiments on three public datasets: ICDAR2013, CTW-1500 and MSRA-TD500. The results show that the proposed method handles irregular scene text well and that it achieves promising results on these three public datasets.
引用
收藏
页数:17
相关论文
共 39 条
  • [11] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [12] Li X, 2018, ARXIV
  • [13] Liao MH, 2016, Arxiv, DOI arXiv:1611.06779
  • [14] Real-Time Scene Text Detection With Differentiable Binarization and Adaptive Scale Fusion
    Liao, Minghui
    Zou, Zhisheng
    Wan, Zhaoyi
    Yao, Cong
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (01) : 919 - 931
  • [15] Long S., 2018, P EUROPEAN C COMPUTE
  • [16] Scene Text Detection and Recognition: The Deep Learning Era
    Long, Shangbang
    He, Xin
    Yao, Cong
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (01) : 161 - 184
  • [17] Transformer-based Text Detection in the Wild
    Raisi, Zobeir
    Naiel, Mohamed A.
    Younes, Georges
    Wardell, Steven
    Zelek, John S.
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, : 3156 - 3165
  • [18] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [19] Shi-Xue Zhang, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P9696, DOI 10.1109/CVPR42600.2020.00972
  • [20] Training Region-based Object Detectors with Online Hard Example Mining
    Shrivastava, Abhinav
    Gupta, Abhinav
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 761 - 769