A Graph-Transformer Network for Scene Text Detection

被引:0
|
作者
Wu, Yongrong [1 ]
Lin, Jingyu [1 ]
Chen, Houjin [1 ]
Chen, Dinghao [1 ]
Yang, Lvqing [1 ]
Xiahou, Jianbing [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361000, Peoples R China
[2] Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V | 2023年 / 14090卷
关键词
Scene Text Detection; Transformer; Graph convolutional network;
D O I
10.1007/978-981-99-4761-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text in natural images with varying orientations and shapes is challenging. Existing detectors often fail with text instances having extreme aspect ratios. This paper introduces GTNet, a Graph- Transformer network for scene text detection. GTNet uses a Graph-based Shared Feature Learning Module (GSFL) for feature extraction and a Transformer-based Regression Module (TRM) for bounding box prediction. Our architecture offers a flexible receptive field, combining global attention and local features for enhanced text representation. Extensive experiments show our method surpasses existing detectors in accuracy and effectiveness.
引用
收藏
页码:680 / 690
页数:11
相关论文
共 50 条
  • [31] Image-text fusion transformer network for sarcasm detection
    Liu, Jing
    Tian, Shengwei
    Yu, Long
    Shi, Xianwei
    Wang, Fan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (14) : 41895 - 41909
  • [32] Transformer-Convolution Network for Arbitrary Shape Text Detection
    Hu, Yucheng
    Zhang, Yuting
    Yu, Wenxin
    Lan, Tianxiang
    Yin, Dong
    PROCEEDINGS OF 2022 THE 6TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND SOFT COMPUTING, ICMLSC 20222, 2022, : 120 - 126
  • [33] Image-text fusion transformer network for sarcasm detection
    Jing Liu
    Shengwei Tian
    Long Yu
    Xianwei Shi
    Fan Wang
    Multimedia Tools and Applications, 2024, 83 : 41895 - 41909
  • [34] RNGDet: Road Network Graph Detection by Transformer in Aerial Images
    Xu, Zhenhua
    Liu, Yuxuan
    Gan, Lu
    Sun, Yuxiang
    Wu, Xinyu
    Liu, Ming
    Wang, Lujia
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [35] STR Transformer: A Cross-domain Transformer for Scene Text Recognition
    Wu, Xing
    Tang, Bin
    Zhao, Ming
    Wang, Jianjia
    Guo, Yike
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3444 - 3458
  • [36] STR Transformer: A Cross-domain Transformer for Scene Text Recognition
    Xing Wu
    Bin Tang
    Ming Zhao
    Jianjia Wang
    Yike Guo
    Applied Intelligence, 2023, 53 : 3444 - 3458
  • [37] A Novel Scene Text Detection Algorithm Based On Convolutional Neural Network
    Ren, Xiaohang
    Chen, Kai
    Yang, Xiaokang
    Zhou, Yi
    He, Jianhua
    Sun, Jun
    2016 30TH ANNIVERSARY OF VISUAL COMMUNICATION AND IMAGE PROCESSING (VCIP), 2016,
  • [38] Mutually Guided Dual-Task Network for Scene Text Detection
    Zhao, Mengbiao
    Feng, Wei
    Yin, Fei
    Zhang, Xu-Yao
    Liu, Cheng-Lin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 6928 - 6934
  • [39] Towards Accurate Scene Text Detection with Bidirectional Feature Pyramid Network
    Cao, Dongping
    Dang, Jiachen
    Zhong, Yong
    SYMMETRY-BASEL, 2021, 13 (03):
  • [40] Instance Segmentation Network With Self-Distillation for Scene Text Detection
    Yang, Peng
    Yang, Guowei
    Gong, Xun
    Wu, Pingping
    Han, Xu
    Wu, Jiasong
    Chen, Caisen
    IEEE ACCESS, 2020, 8 : 45825 - 45836