An efficient and universal polygon prediction method based on derivable analytic geometry for arbitrary-shaped text detection

被引:0
|
作者
Zhang, Xiangnan [1 ]
Tian, Chunna [1 ]
Gao, Xinbo [2 ]
机构
[1] Xidian Univ, Sch Elect Engn, Xian 710071, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
VISUAL COMPUTER | 2024年 / 40卷 / 06期
基金
中国国家自然科学基金;
关键词
Text detection; Intersection over union; Curved text; Polygon prediction;
D O I
10.1007/s00371-023-03081-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A polygon can represent the boundary of curved text more compactly than a rectangle. However, predicting reasonable polygon lacks of solutions due to the complex spatial relationships caused by having more vertices. The two main challenges are how to satisfy the constraints between vertices and how to cope with data conflicts caused by inconsistent annotation standards. To address these problems, we propose a divide and conquer methodology, in which a polygon is considered as a set of convex quadrangles. By predicting quadrangles in sequence, the vertices of the polygon are obtained consecutively and constrained by the previous ones. Then, we propose a measure for the overlap between convex quadrangles, with which the IoU between two polygons is calculated densely. Our method is derivable and can be trained end-to-end. Also, the polygon prediction branch that we proposed is universal and transplantable. We select basic architecture as the backbone, and the text/non-text classification branch adopts an online hard example mining strategy. Experiments on curved benchmark datasets, namely Total Text and CTW1500, demonstrate that our approach achieves state-of-the-art accuracy. It also maintains a high level of inferring efficiency.
引用
收藏
页码:4273 / 4285
页数:13
相关论文
共 50 条
  • [1] Bidirectional Regression for Arbitrary-Shaped Text Detection
    Sheng, Tao
    Lian, Zhouhui
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 187 - 201
  • [2] Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
    Wang, Wenhai
    Xie, Enze
    Song, Xiaoge
    Zang, Yuhang
    Wang, Wenjia
    Lu, Tong
    Yu, Gang
    Shen, Chunhua
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 8439 - 8448
  • [3] Arbitrary-Shaped Text Detection With Adaptive Text Region Representation
    Jiang, Xiufeng
    Xu, Shugong
    Zhang, Shunqing
    Cao, Shan
    IEEE ACCESS, 2020, 8 : 102106 - 102118
  • [4] Fuzzy Semantics for Arbitrary-Shaped Scene Text Detection
    Wang, Fangfang
    Xu, Xiaogang
    Chen, Yifeng
    Li, Xi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 1 - 12
  • [5] Fourier Contour Embedding for Arbitrary-Shaped Text Detection
    Zhu, Yiqin
    Chen, Jianyong
    Liang, Lingyu
    Kuang, Zhanghui
    Jin, Lianwen
    Zhang, Wayne
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 3122 - 3130
  • [6] Wavelet descriptor network for arbitrary-shaped text detection
    Zhang, Zixu
    Tong, Minglei
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [7] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Honghui Chen
    Yuhang Qiu
    Mengxi Jiang
    Jianhui Lin
    Pingping Chen
    Complex & Intelligent Systems, 2024, 10 : 75 - 86
  • [8] Kernel-mask knowledge distillation for efficient and accurate arbitrary-shaped text detection
    Chen, Honghui
    Qiu, Yuhang
    Jiang, Mengxi
    Lin, Jianhui
    Chen, Pingping
    COMPLEX & INTELLIGENT SYSTEMS, 2024, 10 (01) : 75 - 86
  • [9] Arbitrary-shaped scene text detection with keypoint-based shape representation
    Shuxin Qin
    Lin Chen
    International Journal on Document Analysis and Recognition (IJDAR), 2022, 25 : 115 - 127
  • [10] Arbitrary-shaped scene text detection by predicting distance map
    Xinyu Wang
    Yaohua Yi
    Jibing Peng
    Kaili Wang
    Applied Intelligence, 2022, 52 : 14374 - 14386