A Graph-Transformer Network for Scene Text Detection

被引:0
|
作者
Wu, Yongrong [1 ]
Lin, Jingyu [1 ]
Chen, Houjin [1 ]
Chen, Dinghao [1 ]
Yang, Lvqing [1 ]
Xiahou, Jianbing [2 ]
机构
[1] Xiamen Univ, Sch Informat, Xiamen 361000, Peoples R China
[2] Quanzhou Normal Univ, Quanzhou 362000, Fujian, Peoples R China
来源
ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V | 2023年 / 14090卷
关键词
Scene Text Detection; Transformer; Graph convolutional network;
D O I
10.1007/978-981-99-4761-4_57
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting text in natural images with varying orientations and shapes is challenging. Existing detectors often fail with text instances having extreme aspect ratios. This paper introduces GTNet, a Graph- Transformer network for scene text detection. GTNet uses a Graph-based Shared Feature Learning Module (GSFL) for feature extraction and a Transformer-based Regression Module (TRM) for bounding box prediction. Our architecture offers a flexible receptive field, combining global attention and local features for enhanced text representation. Extensive experiments show our method surpasses existing detectors in accuracy and effectiveness.
引用
收藏
页码:680 / 690
页数:11
相关论文
共 50 条
  • [1] Transformer and Graph Convolutional Network for Text Classification
    Liu, Boting
    Guan, Weili
    Yang, Changjin
    Fang, Zhijie
    Lu, Zhiheng
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)
  • [2] Transformer and Graph Convolutional Network for Text Classification
    Boting Liu
    Weili Guan
    Changjin Yang
    Zhijie Fang
    Zhiheng Lu
    International Journal of Computational Intelligence Systems, 16
  • [3] TransText: Improving scene text detection via transformer
    Zhu, Jiajun
    Wang, Guodong
    DIGITAL SIGNAL PROCESSING, 2022, 130
  • [4] Multi-scale graph-transformer network for trajectory prediction of the autonomous vehicles
    Singh, Divya
    Srivastava, Rajeev
    INTELLIGENT SERVICE ROBOTICS, 2022, 15 (03) : 307 - 320
  • [5] Multi-scale graph-transformer network for trajectory prediction of the autonomous vehicles
    Divya Singh
    Rajeev Srivastava
    Intelligent Service Robotics, 2022, 15 : 307 - 320
  • [6] A Graph-Transformer Method for Landslide Susceptibility Mapping
    Zhang, Qing
    He, Yi
    Zhang, Yalei
    Lu, Jiangang
    Zhang, Lifeng
    Huo, Tianbao
    Tang, Jiapeng
    Fang, Yumin
    Zhang, Yunhao
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 14556 - 14574
  • [7] HGR-Net: Hierarchical Graph Reasoning Network for Arbitrary Shape Scene Text Detection
    Bi, Hengyue
    Xu, Canhui
    Shi, Cao
    Liu, Guozhu
    Zhang, Honghong
    Li, Yuteng
    Dong, Junyu
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 4142 - 4155
  • [8] Deep Residual Text Detection Network for Scene Text
    Zhu, Xiangyu
    Jiang, Yingying
    Yang, Shuli
    Wang, Xiaobing
    Li, Wei
    Fu, Pei
    Wang, Hua
    Luo, Zhenbo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 807 - 812
  • [9] MuST: Multimodal Spatiotemporal Graph-Transformer for Hospital Readmission Prediction
    Miao, Yan
    Yu, Lequan
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2023 WORKSHOPS, 2023, 14394 : 276 - 285
  • [10] Refinement Correction Network for Scene Text Detection
    Lian, Zhe
    Yin, Yanjun
    Hu, Wei
    Xu, Qiaozhi
    Zhi, Min
    Lu, Jingfang
    Qi, Xuanhao
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT VIII, ICIC 2024, 2024, 14869 : 93 - 105