Refinement Correction Network for Scene Text Detection

被引:0
|
作者
Lian, Zhe [1 ]
Yin, Yanjun [1 ]
Hu, Wei [1 ]
Xu, Qiaozhi [1 ]
Zhi, Min [1 ]
Lu, Jingfang [1 ]
Qi, Xuanhao [1 ]
机构
[1] Inner Mongolia Normal Univ, Sch Comp Sci & Technol, Hohhot 010022, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024 | 2024年 / 14869卷
关键词
Scene text detection; Rough feature refinement; Clue feature correction; Differentiable binarization;
D O I
10.1007/978-981-97-5603-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In scene text detection, the accurate capture of underlying detail information and high-level semantic information are crucial for the accuracy and reliability of text detection. To this end, existing models primarily employ deep convolutional networks to extract semantic information from images. However, the multiple convolutions and downsampling operations in network lead to varying degrees of defects in shallow and deep features. To address this issue, this paper proposes the Refinement Correction Network (RCNet). Specifically, in the feature extraction process, constructing a Rough Feature Refinement Module (RFRM) based on the idea of image histogram equalization to restore the texture details of coarse results using underlying features. By modeling high-level features in multiple dimensions, a Clue Feature Correction Module (CFCM) is designed to enhance the semantic relevance of high-level features in spatial and channel positions. Experiments on four benchmark datasets validate the superiority of the proposed model over current technologies.
引用
收藏
页码:93 / 105
页数:13
相关论文
共 50 条
  • [31] BDFPN: Bi-Direction Feature Pyramid Network for Scene Text Detection
    Shao, Hai-Lin
    Ji, Yi
    Li, Ying
    Liu, Chun-Ping
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [32] Text proposals with location-awareness-attention network for arbitrarily shaped scene text detection and recognition
    Zhong, Dajian
    Lyu, Shujing
    Shivakumara, Palaiahankote
    Pal, Umapada
    Lu, Yue
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [33] Scene Text Detection Based on Text Stroke Components
    Hou, Xinyue
    Cheng, Pengsen
    Gao, Hongyu
    Li, Xin
    Liu, Jiayong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2025, 35 (05)
  • [34] Multi-Dimension Aware Back Projection Network For Scene Text Detection
    Zhao, Yizhan
    Li, Sumei
    Chang, Yongli
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [35] Scene text detection using enhanced Extremal region and convolutional neural network
    Naiemi, Fatemeh
    Ghods, Vahid
    Khalesi, Hassan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) : 27137 - 27159
  • [36] A Detection and Verification Model Based on SSD and Encoder-Decoder Network for Scene Text Detection
    Gao, Xue
    Han, Siyi
    Luo, Cong
    IEEE ACCESS, 2019, 7 : 71299 - 71310
  • [37] Scene text detection using enhanced Extremal region and convolutional neural network
    Fatemeh Naiemi
    Vahid Ghods
    Hassan Khalesi
    Multimedia Tools and Applications, 2020, 79 : 27137 - 27159
  • [38] A Fast Method for Scene Text Detection
    Fang, Qing
    Yang, Yanping
    Chen, Yali
    Yao, Xiaoyu
    COMPUTER VISION, PT I, 2017, 771 : 738 - 747
  • [39] Using of Attention for Scene Text Detection
    Wang Y.
    Gu X.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2021, 33 (12): : 1908 - 1915
  • [40] Label Enhancement for Scene Text Detection
    MEI Junjun
    GUAN Tao
    TONG Junwen
    ZTE Communications, 2022, 20 (04) : 89 - 95