BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK

被引:6
作者
Yang, Chuang
Chen, Mulin
Yuan, Yuan
Wang, Qi [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
基金
中国国家自然科学基金;
关键词
Arbitrary-shaped text detection; scene text detection; real-time text detector; computer vision;
D O I
10.1109/ICASSP43922.2022.9747331
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting irregular-shaped text instances is the main challenge for text detection. Existing approaches can be roughly divided into top-down and bottom-up perspective methods. The former encodes text contours into unified units, which always fails to fit highly curved text contours. The latter represents text instances by a number of local units, where the complicated network and post-processing lead to slow detection speed. In this paper, to detect arbitrary-shaped text instances with high detection accuracy and speed simultaneously, we propose a Bidirectional Perspective strategy based Network (BiP-Net). Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively. Moreover, a contour connecting (CC) algorithm is proposed to avoid the information loss of text contours by rebuilding interval contours from a bottom-up perspective. The experimental results on MSRA-TD500, CTW1500, and ICDAR2015 datasets demonstrate the superiority of BiP-Net against several state-of-the-art methods.
引用
收藏
页码:2255 / 2259
页数:5
相关论文
共 27 条
  • [1] Character Region Awareness for Text Detection
    Baek, Youngmin
    Lee, Bado
    Han, Dongyoon
    Yun, Sangdoo
    Lee, Hwalsuk
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9357 - 9366
  • [2] Cao M, 2020, INT CONF ACOUST SPEE, P2228, DOI [10.1109/ICASSP40776.2020.9053679, 10.1109/icassp40776.2020.9053679]
  • [3] External Attention Based TransUNet and Label Expansion Strategy for Crack Detection
    Fang, Jie
    Yang, Chen
    Shi, Yuetian
    Wang, Nan
    Zhao, Yang
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 19054 - 19063
  • [4] TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
    Feng, Wei
    He, Wenhao
    Yin, Fei
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9075 - 9084
  • [5] Synthetic Data for Text Localisation in Natural Images
    Gupta, Ankush
    Vedaldi, Andrea
    Zisserman, Andrew
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2315 - 2324
  • [6] Karatzas D, 2015, PROC INT CONF DOC, P1156, DOI 10.1109/ICDAR.2015.7333942
  • [7] CornerNet: Detecting Objects as Paired Keypoints
    Law, Hei
    Deng, Jia
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 765 - 781
  • [8] Li XL, 2017, AAAI CONF ARTIF INTE, P4147
  • [9] Rotation-sensitive Regression for Oriented Scene Text Detection
    Liao, Minghui
    Zhu, Zhen
    Shi, Baoguang
    Xia, Gui-song
    Bai, Xiang
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5909 - 5918
  • [10] Liu Y. L., 2017, CoRR abs/1712.02170