A Graph-Transformer Network for Scene Text Detection

被引:0
|
作者
Wu, Yongrong [1 ]
Lin, Jingyu [1 ]
Chen, Houjin [1 ]
Chen, Dinghao [1 ]
Yang, Lvqing [1 ]
Xiahou, Jianbing [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361000, Peoples R China
[2] Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V | 2023年 / 14090卷
关键词
Scene Text Detection; Transformer; Graph convolutional network;
D O I
10.1007/978-981-99-4761-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text in natural images with varying orientations and shapes is challenging. Existing detectors often fail with text instances having extreme aspect ratios. This paper introduces GTNet, a Graph- Transformer network for scene text detection. GTNet uses a Graph-based Shared Feature Learning Module (GSFL) for feature extraction and a Transformer-based Regression Module (TRM) for bounding box prediction. Our architecture offers a flexible receptive field, combining global attention and local features for enhanced text representation. Extensive experiments show our method surpasses existing detectors in accuracy and effectiveness.
引用
收藏
页码:680 / 690
页数:11
相关论文
共 50 条
  • [21] A Hierarchical Graph-Enhanced Transformer Network for Remote Sensing Scene Classification
    Li, Ziwei
    Xu, Weiming
    Yang, Shiyu
    Wang, Juan
    Su, Hua
    Huang, Zhanchao
    Wu, Sheng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 20315 - 20330
  • [22] A Text-Specific Domain Adaptive Network for Scene Text Detection in the Wild
    He, Xuan
    Yuan, Jin
    Li, Mengyao
    Wang, Runmin
    Wang, Haidong
    Li, Zhiyong
    APPLIED INTELLIGENCE, 2023, 53 (22) : 26827 - 26839
  • [23] Combining Swin Transformer and Attention-Weighted Fusion for Scene Text Detection
    Li, Xianguo
    Yao, Xingchen
    Liu, Yi
    NEURAL PROCESSING LETTERS, 2024, 56 (02)
  • [24] Combining Swin Transformer and Attention-Weighted Fusion for Scene Text Detection
    Xianguo Li
    Xingchen Yao
    Yi Liu
    Neural Processing Letters, 56
  • [25] FTPN: Scene Text Detection With Feature Pyramid Based Text Proposal Network
    Liu, Fagui
    Chen, Cheng
    Gu, Dian
    Zheng, Jingzhong
    IEEE ACCESS, 2019, 7 : 44219 - 44228
  • [26] GCCNet: Grouped channel composition network for scene text detection
    Liu, Chang
    Yang, Chun
    Hou, Jie-Bo
    Wu, Long-Huang
    Zhu, Xiao-Bin
    Xiao, Lei
    Yin, Xu-Cheng
    NEUROCOMPUTING, 2021, 454 : 135 - 151
  • [27] Holistic Vertical Regional Proposal Network for Scene Text Detection
    Ehen, Xu
    Guo, Qiang
    Li, Shuohao
    Zhang, Jun
    2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 72 - 77
  • [28] Lightweight Scene Text Recognition Based on Transformer
    Luan, Xin
    Zhang, Jinwei
    Xu, Miaomiao
    Silamu, Wushouer
    Li, Yanbing
    SENSORS, 2023, 23 (09)
  • [29] ESRNet: an exploring sample relationships network for arbitrary-shaped scene text detection
    Fan, Huageng
    Lu, Tongwei
    APPLIED INTELLIGENCE, 2024, 54 (22) : 11995 - 12008
  • [30] Conceptual text region network: Cognition-inspired accurate scene text detection
    Cui, Chenwei
    Lu, Liangfu
    Tan, Zhiyuan
    Hussain, Amir
    NEUROCOMPUTING, 2021, 464 : 252 - 264