Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:4
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [21] Transformer-based end-to-end scene text recognition
    Zhu, Xinghao
    Zhang, Zhi
    PROCEEDINGS OF THE 2021 IEEE 16TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2021), 2021, : 1691 - 1695
  • [22] Cluttered TextSpotter: An End-to-End Trainable Light-Weight Scene Text Spotter for Cluttered Environment
    Bagi, Randheer
    Dutta, Tanima
    Gupta, Hari Prabhat
    IEEE ACCESS, 2020, 8 : 111433 - 111447
  • [23] RTNet: An End-to-End Method for Handwritten Text Image Translation
    Su, Tonghua
    Liu, Shuchen
    Zhou, Shengjie
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 99 - 113
  • [24] Emotion selectable end-to-end text-based speech editing
    Wang, Tao
    Yi, Jiangyan
    Fu, Ruibo
    Tao, Jianhua
    Wen, Zhengqi
    Zhang, Chu Yuan
    ARTIFICIAL INTELLIGENCE, 2024, 329
  • [25] Supervised Attention Network for Arbitrary-Shaped Text Detection in Edge-Fainted Noisy Scene Images
    Soni, Aishwarya
    Dutta, Tanima
    Nigam, Nitika
    Verma, Deepali
    Gupta, Hari Prabhat
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 1179 - 1188
  • [26] OctShuffleMLT: A Compact Octave Based Neural Network for End-to-End Multilingual Text Detection and Recognition
    Lundgren, Antonio
    Castro, Dayvid
    Lima, Estanislau
    Bezerra, Byron
    2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW) AND 8TH INTERNATIONAL WORKSHOP ON CAMERA-BASED DOCUMENT ANALYSIS AND RECOGNITION, VOL 4, 2019, : 37 - 42
  • [27] End-to-End Speech Keyword Spotting Training Method Based on Sample's Class Uncertainty
    He, Qian-Hua
    Chen, Yong-Qiang
    Zheng, Ruo-Wei
    Huang, Jin-Xin
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (10): : 3482 - 3492
  • [28] TRIE: End-to-End Text Reading and Information Extraction for Document Understanding
    Zhang, Peng
    Xu, Yunlu
    Cheng, Zhanzhan
    Pu, Shiliang
    Lu, Jing
    Qiao, Liang
    Niu, Yi
    Wu, Fei
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1413 - 1422
  • [29] RMFPN: End-to-End Scene Text Recognition Using Multi-Feature Pyramid Network
    Mahadshetti, Ruturaj
    Lee, Guee-Sang
    Choi, Deok-Jai
    IEEE ACCESS, 2023, 11 : 61892 - 61900
  • [30] Towards End-to-End Speech-to-Text Summarization
    Monteiro, Raul
    Pernes, Diogo
    TEXT, SPEECH, AND DIALOGUE, TSD 2023, 2023, 14102 : 304 - 316