Scene text spotting based on end-to-end

被引:0
|
作者
Wei G. [1 ,2 ]
Rong W. [1 ]
Liang Y. [1 ]
Xiao X. [1 ]
Liu X. [1 ]
机构
[1] College of Computer Science and Engineering, Shandong University of Science and Technology, Shandong, Qingdao
[2] College of Intelligent Equipment, Shandong University of Science and Technology, Shandong, Taian
来源
关键词
End-to-end; Joint optimization; SAM-BiLSTM; Scene text spotting; TCM;
D O I
10.3233/JIFS-200903
中图分类号
TN911 [通信理论];
学科分类号
081002 ;
摘要
Aiming at the problem that the traditional OCR processing method ignores the inherent connection between the text detection task and the text recognition task, This paper propose a novel end-to-end text spotting framework. The framework includes three parts: shared convolutional feature network, text detector and text recognizer. By sharing convolutional feature network, the text detection network and the text recognition network can be jointly optimized at the same time. On the one hand, it can reduce the computational burden; on the other hand, it can effectively use the inherent connection between text detection and text recognition. This model add the TCM (Text Context Module) on the basis of Mask RCNN, which can effectively solve the negative sample problem in text detection tasks. This paper propose a text recognition model based on the SAM-BiLSTM (spatial attention mechanism with BiLSTM), which can more effectively extract the semantic information between characters. This model significantly surpasses state-of-the-art methods on a number of text detection and text spotting benchmarks, including ICDAR 2015, Total-Text. © 2021 - IOS Press. All rights reserved.
引用
收藏
页码:8871 / 8881
页数:10
相关论文
共 50 条
  • [41] Soft set-based MSER end-to-end system for occluded scene text detection, recognition and prediction
    Das, Alloy
    Palaiahnakote, Shivakumara
    Banerjee, Ayan
    Antonacopoulos, Apostolos
    Pal, Umapada
    KNOWLEDGE-BASED SYSTEMS, 2024, 305
  • [42] PAN plus plus : Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
    Wang, Wenhai
    Xie, Enze
    Li, Xiang
    Liu, Xuebo
    Liang, Ding
    Zhibo, Yang
    Lu, Tong
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5349 - 5367
  • [43] END-TO-END CHINESE TEXT RECOGNITION
    Hu, Jie
    Guo, Tszhang
    Cao, Ji
    Zhang, Changshui
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 1407 - 1411
  • [44] An End-to-End Attack on Text CAPTCHAs
    Zi, Yang
    Gao, Haichang
    Cheng, Zhouhang
    Liu, Yi
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 15 : 753 - 766
  • [45] An end-to-end text spotter with text relation networks
    Jianguo Jiang
    Baole Wei
    Min Yu
    Gang Li
    Boquan Li
    Chao Liu
    Min Li
    Weiqing Huang
    Cybersecurity, 4
  • [46] An end-to-end text spotter with text relation networks
    Jiang, Jianguo
    Wei, Baole
    Yu, Min
    Li, Gang
    Li, Boquan
    Liu, Chao
    Li, Min
    Huang, Weiqing
    CYBERSECURITY, 2021, 4 (01)
  • [47] RMFPN: End-to-End Scene Text Recognition Using Multi-Feature Pyramid Network
    Mahadshetti, Ruturaj
    Lee, Guee-Sang
    Choi, Deok-Jai
    IEEE ACCESS, 2023, 11 : 61892 - 61900
  • [48] A Robust Ensemble of ResNets for Character Level End-to-end Text Detection in Natural Scene Images
    Kim, Jinsu
    Kim, Yoonhyung
    Kim, Changick
    PROCEEDINGS OF THE 15TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2017,
  • [49] Scene text detection using structured information and an end-to-end trainable generative adversarial networks
    Naveen, Palanichamy
    Hassaballah, Mahmoud
    PATTERN ANALYSIS AND APPLICATIONS, 2024, 27 (02)
  • [50] An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition
    Shi, Baoguang
    Bai, Xiang
    Yao, Cong
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2298 - 2304