BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK

被引：6

作者：

Yang, Chuang

Chen, Mulin

Yuan, Yuan

Wang, Qi ^{[1
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

基金：

中国国家自然科学基金;

关键词：

Arbitrary-shaped text detection; scene text detection; real-time text detector; computer vision;

D O I：

10.1109/ICASSP43922.2022.9747331

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Detecting irregular-shaped text instances is the main challenge for text detection. Existing approaches can be roughly divided into top-down and bottom-up perspective methods. The former encodes text contours into unified units, which always fails to fit highly curved text contours. The latter represents text instances by a number of local units, where the complicated network and post-processing lead to slow detection speed. In this paper, to detect arbitrary-shaped text instances with high detection accuracy and speed simultaneously, we propose a Bidirectional Perspective strategy based Network (BiP-Net). Specifically, a new text representation strategy is proposed to represent text contours from a top-down perspective, which can fit highly curved text contours effectively. Moreover, a contour connecting (CC) algorithm is proposed to avoid the information loss of text contours by rebuilding interval contours from a bottom-up perspective. The experimental results on MSRA-TD500, CTW1500, and ICDAR2015 datasets demonstrate the superiority of BiP-Net against several state-of-the-art methods.

引用

页码：2255 / 2259

页数：5

共 27 条

[11] TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes
Long, Shangbang
Ruan, Jiaqiang
Zhang, Wenjie
He, Xin
Wu, Wenhao
Yao, Cong
[J]. COMPUTER VISION - ECCV 2018, PT II, 2018, 11206 : 19 - 35
[12] SegLink plus plus : Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping
Tang, Jun
Yang, Zhibo
Wang, Yongpan
Zheng, Qi
Xu, Yongchao
Bai, Xiang
[J]. PATTERN RECOGNITION, 2019, 96
[13] Learning Shape-Aware Embedding for Scene Text Detection
Tian, Zhuotao
Shu, Michelle
Lyu, Pengyuan
Li, Ruiyu
Zhou, Chao
Shen, Xiaoyong
Jia, Jiaya
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4229 - 4238
[14] TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
Wang, Fangfang
Chen, Yifeng
Wu, Fei
Li, Xi
[J]. MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 111 - 119
[15] Weakly Supervised Adversarial Domain Adaptation for Semantic Segmentation in Urban Scenes
Wang, Qi
Gao, Junyu
Li, Xuelong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (09) : 4376 - 4386
[16] Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
Wang, Wenhai
Xie, Enze
Song, Xiaoge
Zang, Yuhang
Wang, Wenjia
Lu, Tong
Yu, Gang
Shen, Chunhua
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8439 - 8448
[17] Wang YX, 2020, PROC CVPR IEEE, P11750, DOI 10.1109/CVPR42600.2020.01177
[18] TextField: Learning a Deep Direction Field for Irregular Scene Text Detection
Xu, Yongchao
Wang, Yukang
Zhou, Wei
Wang, Yongpan
Yang, Zhibo
Bai, Xiang
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (11) : 5566 - 5579
[19] A Unified Framework for Multioriented Text Detection and Recognition
Yao, Cong
Bai, Xiang
Liu, Wenyu
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (11) : 4737 - 4749
[20] Yao C, 2012, PROC CVPR IEEE, P1083, DOI 10.1109/CVPR.2012.6247787

← 1 2 3 →