A real-time arbitrary-shape text detector

被引:0
|
作者
Lu, Manhuai [1 ]
Li, Langlang [2 ]
Chen, Chin-Ling [3 ,4 ]
机构
[1] Univ Elect Sci & Technol China, Zhongshan Inst, Coll Mech & Elect Engn, Zhongshan, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Mech & Elect Engn, Chengdu, Peoples R China
[3] Changchun Sci Tech Univ, Sch Informat Engn, Changchun, Peoples R China
[4] Chaoyang Univ Technol, Dept Comp Sci & Informat Engn, Taichung, Taiwan
来源
PLOS ONE | 2024年 / 19卷 / 04期
关键词
D O I
10.1371/journal.pone.0302234
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
It is challenging to detect arbitrary-shape text accurately and effectively in natural scenes. While many methods have been implemented for arbitrary-shape text detection, most cannot achieve real-time detection or meet practical needs. In this work, we propose a YOLOv6-based detector that can effectively implement arbitrary-shape text detection and achieve real-time detection. We include two additional branches in the neck part of the YOLOv6 network to adapt the network to text detection, and the output side uses the pixel aggregation (PA) algorithm to decouple the PA output to use it as the detection head of the model. Experiments on benchmark Total-Text, CTW1500, ICDAR2015, and MSRA-TD500 showed that the proposed method outperformed competing methods in terms of detection accuracy and running time. Specifically, our method achieved an F-measure of 84.1% at 291.8 FPS for 640 x 640 Total-Text images and an F-measure of 81.5% at 199.6 FPS for 896 x 896 ICDAR2015 incidental text images.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] A GPU Implementation of a Real-Time MIMO Detector
    Wu, Michael
    Gupta, Siddharth
    Sun, Yang
    Cavallaro, Joseph R.
    SIPS: 2009 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS, 2009, : 303 - 308
  • [42] Real-time single detector vehicle classification
    Dodsworth, Joel
    Shepherd, Simon
    Liu, Ronghui
    17TH MEETING OF THE EURO WORKING GROUP ON TRANSPORTATION, EWGT2014, 2014, 3 : 942 - 951
  • [43] Experimental real-time detector of GSM terminals
    Vales-Alonso, J
    González-Castaño, FJ
    Pousada-Carballo, JM
    IEEE COMMUNICATIONS LETTERS, 2003, 7 (03) : 148 - 149
  • [44] A Real-Time FPGA Based Human Detector
    Hsiao, Pei-Yung
    Lin, Shih-Yu
    Chen, Chuen-Yau
    2016 INTERNATIONAL SYMPOSIUM ON COMPUTER, CONSUMER AND CONTROL (IS3C), 2016, : 1014 - 1017
  • [45] REAL-TIME DIGITAL HARDWARE PITCH DETECTOR
    DUBNOWSKI, JJ
    SCHAFER, RW
    RABINER, LR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (01): : 2 - 8
  • [46] Real-Time Scene Text Localization and Recognition
    Neumann, Lukas
    Matas, Jiri
    2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2012, : 3538 - 3545
  • [47] Real-time text tracking in natural scenes
    Merino-Gracia, Carlos
    Mirmehdi, Majid
    IET COMPUTER VISION, 2014, 8 (06) : 670 - 681
  • [48] A Real-Time Scene Text to Speech System
    Neumann, Lukas
    Matas, Jiri
    COMPUTER VISION - ECCV 2012, PT III, 2012, 7585 : 619 - 622
  • [49] Real-Time Visual Analytics for Text Streams
    Keim, Daniel A.
    Krstajic, Milos
    Rohrdantz, Christian
    Schreck, Tobias
    COMPUTER, 2013, 46 (07) : 47 - 55
  • [50] Text modeling for real-time document categorization
    Byrnes, John
    Rohwer, Richard
    2005 IEEE Aerospace Conference, Vols 1-4, 2005, : 3081 - 3091