BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK

被引:6
作者
Yang, Chuang
Chen, Mulin
Yuan, Yuan
Wang, Qi [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
基金
中国国家自然科学基金;
关键词
Arbitrary-shaped text detection; scene text detection; real-time text detector; computer vision;
D O I
10.1109/ICASSP43922.2022.9747331
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting irregular-shaped text instances is the main challenge for text detection. Existing approaches can be roughly divided into top-down and bottom-up perspective methods. The former encodes text contours into unified units, which always fails to fit highly curved text contours. The latter represents text instances by a number of local units, where the complicated network and post-processing lead to slow detection speed. In this paper, to detect arbitrary-shaped text instances with high detection accuracy and speed simultaneously, we propose a Bidirectional Perspective strategy based Network (BiP-Net). Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively. Moreover, a contour connecting (CC) algorithm is proposed to avoid the information loss of text contours by rebuilding interval contours from a bottom-up perspective. The experimental results on MSRA-TD500, CTW1500, and ICDAR2015 datasets demonstrate the superiority of BiP-Net against several state-of-the-art methods.
引用
收藏
页码:2255 / 2259
页数:5
相关论文
共 27 条
  • [21] VSSA-NET: Vertical Spatial Sequence Attention Network for Traffic Sign Detection
    Yuan, Yuan
    Xiong, Zhitong
    Wang, Qi
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) : 3423 - 3434
  • [22] Yuan Yuan, 2019, SPATIAL STRUCTURE, V15
  • [23] Zhang L, 2020, INT CONF ACOUST SPEE, P4272, DOI [10.1109/ICASSP40776.2020.9054213, 10.1109/icassp40776.2020.9054213]
  • [24] Zhang S., 2020, IEEE T, V23, P454
  • [25] Zhong ZY, 2017, INT CONF ACOUST SPEE, P1208, DOI 10.1109/ICASSP.2017.7952348
  • [26] EAST: An Efficient and Accurate Scene Text Detector
    Zhou, Xinyu
    Yao, Cong
    Wen, He
    Wang, Yuzhi
    Zhou, Shuchang
    He, Weiran
    Liang, Jiajun
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2642 - 2651
  • [27] Fourier Contour Embedding for Arbitrary-Shaped Text Detection
    Zhu, Yiqin
    Chen, Jianyong
    Liang, Lingyu
    Kuang, Zhanghui
    Jin, Lianwen
    Zhang, Wayne
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3122 - 3130