Refinement Correction Network for Scene Text Detection

被引:0
|
作者
Lian, Zhe [1 ]
Yin, Yanjun [1 ]
Hu, Wei [1 ]
Xu, Qiaozhi [1 ]
Zhi, Min [1 ]
Lu, Jingfang [1 ]
Qi, Xuanhao [1 ]
机构
[1] Inner Mongolia Normal Univ, Sch Comp Sci & Technol, Hohhot 010022, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024 | 2024年 / 14869卷
关键词
Scene text detection; Rough feature refinement; Clue feature correction; Differentiable binarization;
D O I
10.1007/978-981-97-5603-2_8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In scene text detection, the accurate capture of underlying detail information and high-level semantic information are crucial for the accuracy and reliability of text detection. To this end, existing models primarily employ deep convolutional networks to extract semantic information from images. However, the multiple convolutions and downsampling operations in network lead to varying degrees of defects in shallow and deep features. To address this issue, this paper proposes the Refinement Correction Network (RCNet). Specifically, in the feature extraction process, constructing a Rough Feature Refinement Module (RFRM) based on the idea of image histogram equalization to restore the texture details of coarse results using underlying features. By modeling high-level features in multiple dimensions, a Clue Feature Correction Module (CFCM) is designed to enhance the semantic relevance of high-level features in spatial and channel positions. Experiments on four benchmark datasets validate the superiority of the proposed model over current technologies.
引用
收藏
页码:93 / 105
页数:13
相关论文
共 50 条
  • [41] Scene Text Detection with Selected Anchors
    Zhu, Anna
    Du, Hang
    Xiong, Shengwu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6608 - 6615
  • [42] Scene Text Detection with Scribble Line
    Zhang, Wenqing
    Qiu, Yang
    Liao, Minghui
    Zhang, Rui
    Wei, Xiaolin
    Bai, Xiang
    DOCUMENT ANALYSIS AND RECOGNITION, ICDAR 2021, PT IV, 2021, 12824 : 79 - 94
  • [43] Elite Loss for scene text detection
    Zhao, Xu
    Zhao, Chaoyang
    Guo, Haiyun
    Zhu, Yousong
    Tang, Ming
    Wang, Jinqiao
    NEUROCOMPUTING, 2019, 333 : 284 - 291
  • [44] Scene Text Detection Based on Text Probability and Pruning Algorithm
    Zhou, Gang
    Liu, Yajun
    Shi, Fei
    Hu, Ying
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2016, PT III, 2016, 9773 : 726 - 735
  • [45] A Real-Time Scene Uyghur Text Detection Network Based on Feature Complementation
    Ibrayim, Mayire
    Chen, Mengmeng
    Hamdulla, Askar
    Kane, Jianjun
    Zhang, Chunhu
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT V, 2024, 14808 : 59 - 75
  • [46] GISCA: Gradient-Inductive Segmentation Network With Contextual Attention for Scene Text Detection
    Cao, Meng
    Zou, Yuexian
    Yang, Dongming
    Liu, Chao
    IEEE ACCESS, 2019, 7 : 62805 - 62816
  • [47] MFECN: Multi-level Feature Enhanced Cumulative Network for Scene Text Detection
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 17 (03)
  • [48] Video Scene Text Frames Categorization for Text Detection and Recognition
    Qin, Longfei
    Shivakumara, Palaiahnakote
    Lu, Tong
    Pal, Umapada
    Tan, Chew Lim
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3886 - 3891
  • [49] FDTA: Fully Convolutional Scene Text Detection With Text Attention
    Cao, Yongcun
    Ma, Shuaisen
    Pan, Haichuan
    IEEE ACCESS, 2020, 8 : 155441 - 155449
  • [50] Contextual Text Block Detection Towards Scene Text Understanding
    Xue, Chuhui
    Huang, Jiaxing
    Zhang, Wenqing
    Lu, Shijian
    Wang, Changhu
    Bai, Song
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 374 - 391