Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition

被引:12
作者
Zhong, Dajian [1 ]
Lyu, Shujing [1 ,2 ]
Shivakumara, Palaiahankote [3 ]
Pal, Umapada [4 ]
Lu, Yue [1 ,2 ]
机构
[1] East China Normal Univ, Shanghai Key Lab Multidimens Informat Proc, Shanghai 200241, Peoples R China
[2] East China Normal Univ, Sch Commun & Elect Engn, Shanghai 200241, Peoples R China
[3] Univ Malaya, Fac Comp Sci & Informat Technol FSKTM, Kuala Lumpur 50603, Malaysia
[4] Indian Stat Inst, CVPR Unit, Kolkata 700108, India
基金
中国国家自然科学基金;
关键词
Scene text detection; Scene text recognition; Text proposal; Attention model; Location-awareness-attention model; NEURAL-NETWORK; IMAGE;
D O I
10.1016/j.eswa.2022.117564
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Unlike existing models that aim to address the challenge of scene text detection and recognition separately, the proposed work aims to address both text detection and recognition using a single architecture to deal with arbitrarily oriented/shaped text. Towards this aim, a novel Text Proposal with Location-AwarenessAttention Network (TPLAANet) for arbitrarily oriented/shaped text detection and recognition is proposed. For text detection, the proposed method explores central mask prediction for locating text instances, bounding box regression branch for tight bounding boxes, and mask branch for accurate positions of arbitrarily oriented/shaped text instances. For text recognition, the proposed method explores character information using a Location-Awareness-Attention Network (LAAN), which learns a two-dimensional attention weight and improves the recognition performance. To test the efficacy of the proposed model, we consider the commonly used horizontal and multi-oriented natural scene text datasets, namely, ICDAR2013, ICDAR2015, and the arbitrarily shaped scene text datasets, namely, Total-Text and CTW1500 for experimentation. Experimental results are provided to validate the effectiveness of the proposed method. The code is available at: https: //codeocean.com/capsule/5666319/tree/v1.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Dual Relation Network for Scene Text Recognition
    Li, Ming
    Fu, Bin
    Chen, Han
    He, Junjun
    Qiao, Yu
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4094 - 4107
  • [42] Adaptive embedding gate for attention-based scene text recognition
    Chen, Xiaoxue
    Wang, Tianwei
    Zhu, Yuanzhi
    Jin, Lianwen
    Luo, Canjie
    [J]. NEUROCOMPUTING, 2020, 381 : 261 - 271
  • [43] SEMANTIC-COMPENSATED AND ATTENTION-GUIDED NETWORK FOR SCENE TEXT DETECTION
    Zhao, Yizhan
    Li, Sumei
    Li, Yueyang
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 939 - 943
  • [44] Text Enhancement Network for Cross-Domain Scene Text Detection
    Deng, Jinhong
    Luo, Xiulian
    Zheng, Jiawen
    Dang, Wanli
    Li, Wen
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2203 - 2207
  • [45] Scene Text Detection with Text Statistical Characteristics and Deep Neural Network
    Qu, Yanyun
    Yang, Xiaodong
    Lin, Li
    [J]. COMPUTER VISION, PT III, 2017, 773 : 245 - 254
  • [46] Scene text detection and recognition system for visually impaired people in real world
    Fei, Lei
    Wang, Kaiwei
    Lin, Shufei
    Yang, Kailun
    Cheng, Ruiqi
    Chen, Hao
    [J]. TARGET AND BACKGROUND SIGNATURES IV, 2018, 10794
  • [47] EFFICIENT SCENE TEXT DETECTION WITH TEXTUAL ATTENTION TOWER
    Zhang, Liang
    Liu, Yufei
    Xiao, Hang
    Yang, Lu
    Zhu, Guangming
    Shah, Syed Afaq
    Bennamou, Mohammed
    Shen, Peiyi
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4272 - 4276
  • [48] FEATURE FUSION NETWORK FOR SCENE TEXT DETECTION
    Cai, Chenqin
    Lv, Pin
    Su, Bing
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2755 - 2759
  • [49] Scene Text Detection Based On Fusion Network
    Zhao, Xuezhuan
    Zhou, Ziheng
    Li, Lingling
    Pei, Lishen
    Ye, Zhaoyi
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2021, 35 (10)
  • [50] Refinement Correction Network for Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Hu, Wei
    Xu, Qiaozhi
    Zhi, Min
    Lu, Jingfang
    Qi, Xuanhao
    [J]. ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 93 - 105