A Direct Regression Scene Text Detector With Position-Sensitive Segmentation

被引:20
作者
Cheng, Peirui [1 ]
Cai, Yuanqiang [1 ]
Wang, Weiqiang [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China
关键词
Scene text detection; fully convolutional network; direct regression; position-sensitive segmentation;
D O I
10.1109/TCSVT.2019.2947475
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Direct regression methods have demonstrated their success on various multi-oriented benchmarks for scene text detection due to the high recall rate for small targets and the direct regression for text boxes. However, too many false positive candidates and inaccurate position regression still limit the performance of these methods. In this paper, we propose an end-to-end method by introducing position-sensitive segmentation into the direct regression method to overcome these shortcomings. We generate the ground truth of position-sensitive segmentation maps based on the information of text boxes so that the position-sensitive segmentation module can be trained synchronously with the direct regression module. Besides, more information about the relative position of text is provided for the network through the training of position-sensitive segmentation maps, which improves the expressiveness of the network. We also introduce spatial pyramid of position-sensitive segmentation into the proposed method considering the huge differences in sizes and aspect ratios of scene texts and we propose position-sensitive COI(Corner area of Interest) pooling into the proposed method to speed up the inference. Experiments on datasets ICDAR2015, MLT-17 and COCO-Text demonstrate that the proposed method has a comparable performance with state-of-the-art methods while it is more efficient. We also provide abundant ablation experiments to demonstrate the effectiveness of these improvements in our proposed method.
引用
收藏
页码:4171 / 4181
页数:11
相关论文
共 29 条
  • [21] Multi-orientation scene text detection with scale-guided regression
    Liang, Min
    Hou, Jie-Bo
    Zhu, Xiaobin
    Yang, Chun
    Qin, Jingyan
    Yin, Xu-Cheng
    NEUROCOMPUTING, 2021, 461 : 310 - 318
  • [22] Arbitrary shape scene text detector with accurate text instance generation based on instance-relevant contexts
    Haiyan Li
    Yangsong Zhang
    Bayram Bayramli
    Hongtao Lu
    Multimedia Tools and Applications, 2023, 82 : 17827 - 17852
  • [23] Arbitrary shape scene text detector with accurate text instance generation based on instance-relevant contexts
    Li, Haiyan
    Zhang, Yangsong
    Bayramli, Bayram
    Lu, Hongtao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) : 17827 - 17852
  • [24] GISCA: Gradient-Inductive Segmentation Network With Contextual Attention for Scene Text Detection
    Cao, Meng
    Zou, Yuexian
    Yang, Dongming
    Liu, Chao
    IEEE ACCESS, 2019, 7 : 62805 - 62816
  • [25] Multiorientation/multiscript scene text detection based on projection profile analysis and graph segmentation
    Koo, Hyung I. I.
    JOURNAL OF ELECTRONIC IMAGING, 2016, 25 (06)
  • [26] Boundary-Aware Arbitrary-Shaped Scene Text Detector With Learnable Embedding Network
    Xing, Mengting
    Xie, Hongtao
    Tan, Qingfeng
    Fang, Shancheng
    Wang, Yuxin
    Zha, Zhengjun
    Zhang, Yongdong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 3129 - 3143
  • [27] GDText-VM: an arbitrary-shaped scene text detector based on globally deformable VMamba
    Yingnan Zhao
    Zheng Hu
    Fangqi Ding
    Jielin Jiang
    Xiaolong Xu
    Complex & Intelligent Systems, 2025, 11 (8)
  • [28] PCBSNet: A Pure Convolutional Bilateral Segmentation Network for Real-Time Natural Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Zhi, Min
    Xu, Qiaozhi
    ELECTRONICS, 2023, 12 (14)
  • [29] Natural scene text detection with multi-layer segmentation and higher order conditional random field based analysis
    Wang, Xiaobing
    Song, Yonghong
    Zhang, Yuanlin
    Xin, Jingmin
    PATTERN RECOGNITION LETTERS, 2015, 60-61 : 41 - 47