BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK

被引:6
作者
Yang, Chuang
Chen, Mulin
Yuan, Yuan
Wang, Qi [1 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
基金
中国国家自然科学基金;
关键词
Arbitrary-shaped text detection; scene text detection; real-time text detector; computer vision;
D O I
10.1109/ICASSP43922.2022.9747331
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Detecting irregular-shaped text instances is the main challenge for text detection. Existing approaches can be roughly divided into top-down and bottom-up perspective methods. The former encodes text contours into unified units, which always fails to fit highly curved text contours. The latter represents text instances by a number of local units, where the complicated network and post-processing lead to slow detection speed. In this paper, to detect arbitrary-shaped text instances with high detection accuracy and speed simultaneously, we propose a Bidirectional Perspective strategy based Network (BiP-Net). Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively. Moreover, a contour connecting (CC) algorithm is proposed to avoid the information loss of text contours by rebuilding interval contours from a bottom-up perspective. The experimental results on MSRA-TD500, CTW1500, and ICDAR2015 datasets demonstrate the superiority of BiP-Net against several state-of-the-art methods.
引用
收藏
页码:2255 / 2259
页数:5
相关论文
共 27 条
  • [11] TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
    Long, Shangbang
    Ruan, Jiaqiang
    Zhang, Wenjie
    He, Xin
    Wu, Wenhao
    Yao, Cong
    [J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 19 - 35
  • [12] SegLink plus plus : Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping
    Tang, Jun
    Yang, Zhibo
    Wang, Yongpan
    Zheng, Qi
    Xu, Yongchao
    Bai, Xiang
    [J]. PATTERN RECOGNITION, 2019, 96
  • [13] Learning Shape-Aware Embedding for Scene Text Detection
    Tian, Zhuotao
    Shu, Michelle
    Lyu, Pengyuan
    Li, Ruiyu
    Zhou, Chao
    Shen, Xiaoyong
    Jia, Jiaya
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4229 - 4238
  • [14] TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
    Wang, Fangfang
    Chen, Yifeng
    Wu, Fei
    Li, Xi
    [J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 111 - 119
  • [15] Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes
    Wang, Qi
    Gao, Junyu
    Li, Xuelong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4376 - 4386
  • [16] Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
    Wang, Wenhai
    Xie, Enze
    Song, Xiaoge
    Zang, Yuhang
    Wang, Wenjia
    Lu, Tong
    Yu, Gang
    Shen, Chunhua
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8439 - 8448
  • [17] Wang YX, 2020, PROC CVPR IEEE, P11750, DOI 10.1109/CVPR42600.2020.01177
  • [18] TextField: Learning a Deep Direction Field for Irregular Scene Text Detection
    Xu, Yongchao
    Wang, Yukang
    Zhou, Wei
    Wang, Yongpan
    Yang, Zhibo
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5566 - 5579
  • [19] A Unified Framework for Multioriented Text Detection and Recognition
    Yao, Cong
    Bai, Xiang
    Liu, Wenyu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (11) : 4737 - 4749
  • [20] Yao C, 2012, PROC CVPR IEEE, P1083, DOI 10.1109/CVPR.2012.6247787