BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK

被引：6

作者：

Yang, Chuang

Chen, Mulin

Yuan, Yuan

Wang, Qi ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

基金：

中国国家自然科学基金;

关键词：

Arbitrary-shaped text detection; scene text detection; real-time text detector; computer vision;

D O I：

10.1109/ICASSP43922.2022.9747331

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Detecting irregular-shaped text instances is the main challenge for text detection. Existing approaches can be roughly divided into top-down and bottom-up perspective methods. The former encodes text contours into unified units, which always fails to fit highly curved text contours. The latter represents text instances by a number of local units, where the complicated network and post-processing lead to slow detection speed. In this paper, to detect arbitrary-shaped text instances with high detection accuracy and speed simultaneously, we propose a Bidirectional Perspective strategy based Network (BiP-Net). Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively. Moreover, a contour connecting (CC) algorithm is proposed to avoid the information loss of text contours by rebuilding interval contours from a bottom-up perspective. The experimental results on MSRA-TD500, CTW1500, and ICDAR2015 datasets demonstrate the superiority of BiP-Net against several state-of-the-art methods.

引用

页码：2255 / 2259

页数：5

共 27 条

[1] Character Region Awareness for Text Detection
Baek, Youngmin
Lee, Bado
Han, Dongyoon
Yun, Sangdoo
Lee, Hwalsuk
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9357 - 9366
[2] Cao M, 2020, INT CONF ACOUST SPEE, P2228, DOI [10.1109/ICASSP40776.2020.9053679, 10.1109/icassp40776.2020.9053679]
[3] External Attention Based TransUNet and Label Expansion Strategy for Crack Detection
Fang, Jie
Yang, Chen
Shi, Yuetian
Wang, Nan
Zhao, Yang
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 19054 - 19063
[4] TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting
Feng, Wei
He, Wenhao
Yin, Fei
Zhang, Xu-Yao
Liu, Cheng-Lin
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9075 - 9084
[5] Synthetic Data for Text Localisation in Natural Images
Gupta, Ankush
Vedaldi, Andrea
Zisserman, Andrew
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2315 - 2324
[6] Karatzas D, 2015, PROC INT CONF DOC, P1156, DOI 10.1109/ICDAR.2015.7333942
[7] CornerNet: Detecting Objects as Paired Keypoints
Law, Hei
Deng, Jia
[J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 765 - 781
[8] Li XL, 2017, AAAI CONF ARTIF INTE, P4147
[9] Rotation-sensitive Regression for Oriented Scene Text Detection
Liao, Minghui
Zhu, Zhen
Shi, Baoguang
Xia, Gui-song
Bai, Xiang
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5909 - 5918
[10] Liu Y. L., 2017, CoRR abs/1712.02170

← 1 2 3 →