A real-time arbitrary-shape text detector

被引:0
|
作者
Lu, Manhuai [1 ]
Li, Langlang [2 ]
Chen, Chin-Ling [3 ,4 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Coll Mech & Elect Engn, Zhongshan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Mech & Elect Engn, Chengdu, Peoples R China
[3] Changchun Sci Tech Univ, Sch Informat Engn, Changchun, Peoples R China
[4] Chaoyang Univ Technol, Dept Comp Sci & Informat Engn, Taichung, Taiwan
来源
PLOS ONE | 2024年 / 19卷 / 04期
关键词
D O I
10.1371/journal.pone.0302234
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is challenging to detect arbitrary-shape text accurately and effectively in natural scenes. While many methods have been implemented for arbitrary-shape text detection, most cannot achieve real-time detection or meet practical needs. In this work, we propose a YOLOv6-based detector that can effectively implement arbitrary-shape text detection and achieve real-time detection. We include two additional branches in the neck part of the YOLOv6 network to adapt the network to text detection, and the output side uses the pixel aggregation (PA) algorithm to decouple the PA output to use it as the detection head of the model. Experiments on benchmark Total-Text, CTW1500, ICDAR2015, and MSRA-TD500 showed that the proposed method outperformed competing methods in terms of detection accuracy and running time. Specifically, our method achieved an F-measure of 84.1% at 291.8 FPS for 640 x 640 Total-Text images and an F-measure of 81.5% at 199.6 FPS for 896 x 896 ICDAR2015 incidental text images.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Progressive Contour Regression for Arbitrary-Shape Scene Text Detection
    Dai, Pengwen
    Zhang, Sanyi
    Zhang, Hua
    Cao, Xiaochun
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7389 - 7398
  • [2] R-YOLO: A Real-Time Text Detector for Natural Scenes with Arbitrary Rotation
    Wang, Xiqi
    Zheng, Shunyi
    Zhang, Ce
    Li, Rui
    Gui, Li
    SENSORS, 2021, 21 (03) : 1 - 21
  • [3] MorphText: Deep Morphology Regularized Accurate Arbitrary-Shape Scene Text Detection
    Xu, Chengpei
    Jia, Wenjing
    Wang, Ruomei
    Luo, Xiaonan
    He, Xiangjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4199 - 4212
  • [4] OPMP: An Omnidirectional Pyramid Mask Proposal Network for Arbitrary-Shape Scene Text Detection
    Zhang, Sheng
    Liu, Yuliang
    Jin, Lianwen
    Wei, Zhongrong
    Shen, Chunhua
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 454 - 467
  • [5] STELA: A Real-Time Scene Text Detector With Learned Anchor
    Deng, Linjie
    Gong, Yanxiang
    Lu, Xinchen
    Lin, Yi
    Ma, Zheng
    Xie, Mei
    IEEE ACCESS, 2019, 7 : 153400 - 153407
  • [6] Arbitrary-Shape Scene Text Detection via Visual-Relational Rectification and Contour Approximation
    Xu, Chengpei
    Jia, Wenjing
    Cui, Tingcheng
    Wang, Ruomei
    Zhang, Yuan-fang
    He, Xiangjian
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4052 - 4066
  • [7] IoU-Related Arbitrary Shape Text Scoring Detector
    Liu, Fagui
    Gu, Dian
    Chen, Cheng
    IEEE ACCESS, 2019, 7 : 180428 - 180437
  • [8] Arbitrary-shape transformation multiphysics cloak by topology optimization
    Zhu, Zhan
    Wang, Zhaochen
    Liu, Tianfeng
    Xie, Bin
    Luo, Xiaobing
    Choi, Wonjoon
    Hu, Run
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2024, 222
  • [9] A Real-Time Deformable Detector
    Ali, Karim
    Fleuret, Francois
    Hasler, David
    Fua, Pascal
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (02) : 225 - 239
  • [10] A real-time face detector
    Zhang, SC
    Liu, ZQ
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 2197 - 2202