Toward Arbitrary-Shaped Text Spotting Based on End-to-End

被引:4
|
作者
Wei, Guangcun [1 ,2 ]
Rong, Wansheng [1 ]
Liang, Yongquan [1 ]
Xiao, Xinguang [1 ]
Liu, Xiang [1 ]
机构
[1] Shandong Univ Sci & Technol, Coll Comp Sci & Engn, Qingdao 266590, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Intelligent Equipment, Tai An 271019, Shandong, Peoples R China
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Text recognition; Feature extraction; Task analysis; Detectors; Optimization; Convolution; Optical character recognition software; Natural scene text spotting; SA-BiLSTM; end-to-end; joint optimization; SCENE TEXT; RECOGNITION;
D O I
10.1109/ACCESS.2020.3020387
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
At present, text spotting in natural scenes has become one of the research hotspots. Among them, curvilinear text and long text are the main difficulties of text spotting in natural scenes. To better solve these two types of problems, we propose a novel end-to-end text spotting model. The model includes three parts: shared convolution module, text detector module and text recognizer module. For the problem of long text, we adopt the corner attention mechanism to extract the features of long text more effectively. For the problem of curve text, we feed the rectification feature map into the SA-BiLSTM decoder to recognize the curve text more effectively. More importantly, the joint optimization strategy realizes the mutual promotion function of the text detection task and the text recognition task. Experimental results on TotalText, ICDAR2015, ICDAR2013, CTW1500, COCO-Text and MLT datasets prove that our method achieves excellent performance and robustness in text spotting tasks based on end-to-end natural scenes.
引用
收藏
页码:159906 / 159914
页数:9
相关论文
共 50 条
  • [41] End-to-End Chinese Image Text Recognition with Attention Model
    Sheng, Fenfen
    Zhai, Chuanlei
    Chen, Zhineng
    Xu, Bo
    NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 180 - 189
  • [42] Multitask Training with Text Data for End-to-End Speech Recognition
    Wang, Peidong
    Sainath, Tara N.
    Weiss, Ron J.
    INTERSPEECH 2021, 2021, : 2566 - 2570
  • [43] An End-to-End System for Text Extraction in Indian Identity Cards
    Kedlaya, Arjun S.
    Amudha, J.
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION NETWORKS (ICCCN 2021), 2022, 394 : 427 - 436
  • [44] aDCF Loss Function for Deep Metric Learning in End-to-End Text-Dependent Speaker Verification Systems
    Mingote, Victoria
    Miguel, Antonio
    Ribas, Dayana
    Ortega, Alfonso
    Lleida, Eduardo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 772 - 784
  • [45] An End-to-End Anti-Jamming Target Detection Method Based on CNN
    Zhang, Yu
    Jiu, Bo
    Wang, Penghui
    Liu, Hongwei
    Liang, Siyuan
    IEEE SENSORS JOURNAL, 2021, 21 (19) : 21817 - 21828
  • [46] A Robust and Accurate End-to-End Template Matching Method Based on the Siamese Network
    Ren, Qiang
    Zheng, Yongbin
    Sun, Peng
    Xu, Wanying
    Zhu, Di
    Yang, Dongxu
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [47] Neural Network Based End-to-End Query by Example Spoken Term Detection
    Ram, Dhananjay
    Miculicich, Lesly
    Bourlard, Herve
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 1416 - 1427
  • [48] Two End-to-End Quantum-Inspired Deep Neural Networks for Text Classification
    Shi, Jinjing
    Li, Zhenhuan
    Lai, Wei
    Li, Fangfang
    Shi, Ronghua
    Feng, Yanyan
    Zhang, Shichao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) : 4335 - 4345
  • [49] Toward Explainable End-to-End Driving Models via Simplified Objectification Constraints
    Zhang, Chenkai
    Deguchi, Daisuke
    Chen, Jialei
    Murase, Hiroshi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (10) : 14521 - 14534
  • [50] Robust End-to-End Offline Chinese Handwriting Text Page Spotter with Text Kernel
    Wang, Zhihao
    Yu, Yanwei
    Wang, Yibo
    Long, Haixu
    Wang, Fazheng
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT II, 2021, 12917 : 21 - 35