Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:4
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [1] Boundary TextSpotter: Toward Arbitrary-Shaped Scene Text Spotting
    Lu, Pu
    Wang, Hao
    Zhu, Shenggao
    Wang, Jing
    Bai, Xiang
    Liu, Wenyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 6200 - 6212
  • [2] Scene text spotting based on end-to-end
    Wei G.
    Rong W.
    Liang Y.
    Xiao X.
    Liu X.
    Journal of Intelligent and Fuzzy Systems, 2021, 40 (05) : 8871 - 8881
  • [3] Towards End-to-End Text Spotting in Natural Scenes
    Wang, Peng
    Li, Hui
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7266 - 7281
  • [4] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
    Zhang, Yi
    Yang, Wei
    Xu, Zhenbo
    Li, Yingjie
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
  • [5] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
    Liao, Minghui
    Lyu, Pengyuan
    He, Minghang
    Yao, Cong
    Wu, Wenhao
    Bai, Xiang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (02) : 532 - 548
  • [6] PAN plus plus : Towards Efficient and Accurate End-to-End Spotting of Arbitrarily-Shaped Text
    Wang, Wenhai
    Xie, Enze
    Li, Xiang
    Liu, Xuebo
    Liang, Ding
    Zhibo, Yang
    Lu, Tong
    Shen, Chunhua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (09) : 5349 - 5367
  • [7] End-to-End Video Text Spotting with Transformer
    Wu, Weijia
    Cai, Yuanqiang
    Shen, Chunhua
    Zhang, Debing
    Fu, Ying
    Zhou, Hong
    Luo, Ping
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (09) : 4019 - 4035
  • [8] FREE: A Fast and Robust End-to-End Video Text Spotter
    Cheng, Zhanzhan
    Lu, Jing
    Zou, Baorui
    Qiao, Liang
    Xu, Yunlu
    Pu, Shiliang
    Niu, Yi
    Wu, Fei
    Zhou, Shuigeng
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 822 - 837
  • [9] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
    Lyu, Pengyuan
    Liao, Minghui
    Yao, Cong
    Wu, Wenhao
    Bai, Xiang
    COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 71 - 88
  • [10] ADATS: Adaptive RoI-Align based Transformer for End-to-End Text Spotting
    Huang, Zepeng
    Wan, Qi
    Chen, Junliang
    Zhao, Xiaodong
    Ye, Kai
    Shen, Linlin
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1403 - 1408