Margin Guidance Network for Arbitrary-shaped Scene Text Detection

被引:0
作者
Li, Xin [1 ]
Wu, Xingjiao [1 ]
Ma, Tianlong [1 ]
Zhou, Zhao [2 ]
Chen, Luhui [2 ]
He, Liang [1 ]
机构
[1] East China Normal Univ, Shanghai, Peoples R China
[2] Videt Tech Ltd, Shanghai, Peoples R China
来源
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2020年
关键词
Scene text detection; Margin Guidance Network; arbitrary-shaped text;
D O I
10.1109/ICTAI50040.2020.00169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Segmentation-based scene text detection approaches have been adopted to arbitrary-shaped texts and have achieved a great progress. However, false detection always easily exist when the arbitrary-shaped texts are close to each other. In this paper, we propose the Margin Guidance Network (MGN) that mainly based on the margin constraint residual module (MCRM) to address aforementioned problem. The MCRM considers the margins between multiple text instance masks to guide the training of network and improve the performance on text detection. The MCRM contains two prediction branch, the one can generate the multiple different scale of masks for a text instance and the other branch is used to generate multiple margins between the above masks. Experimental results on three public benchmarks including ICDAR2015, CTW1500 and Total-Text have demonstrated that the proposed MGN achieves the state-of-the-art results.
引用
收藏
页码:1111 / 1117
页数:7
相关论文
共 50 条
  • [11] Cross-Level Attention Based Adaptive Feature Alignment Network for Arbitrary-Shaped Text Detection
    Zhang, Haiyan
    Li, Sumei
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2243 - 2248
  • [12] Learning Pixel Affinity Pyramid for Arbitrary-Shaped Text Detection
    Fu, Zilong
    Xie, Hongtao
    Fang, Shancheng
    Wang, Yuxin
    Xing, Mengting
    Zhang, Yongdong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [13] BIP-NET: BIDIRECTIONAL PERSPECTIVE STRATEGY BASED ARBITRARY-SHAPED TEXT DETECTION NETWORK
    Yang, Chuang
    Chen, Mulin
    Yuan, Yuan
    Wang, Qi
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 2255 - 2259
  • [14] Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images
    Guo, Youhui
    Zhou, Yu
    Qin, Xugong
    Wang, Weiping
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 271 - 283
  • [15] Region-Aware Arbitrary-Shaped Text Detection With Progressive Fusion
    Wang, Qitong
    Fu, Bin
    Li, Ming
    He, Junjun
    Peng, Xi
    Qiao, Yu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 4718 - 4729
  • [16] POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING
    Zhang, Yi
    Yang, Wei
    Xu, Zhenbo
    Li, Yingjie
    Chen, Zhi
    Huang, Liusheng
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2375 - 2379
  • [17] CM-Net: Concentric Mask Based Arbitrary-Shaped Text Detection
    Yang, Chuang
    Chen, Mulin
    Xiong, Zhitong
    Yuan, Yuan
    Wang, Qi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 2864 - 2877
  • [18] CT-Net: Arbitrary-Shaped Text Detection via Contour Transformer
    Shao, Zhiwen
    Su, Yuchen
    Zhou, Yong
    Meng, Fanrong
    Zhu, Hancheng
    Liu, Bing
    Yao, Rui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1815 - 1826
  • [19] SegLink plus plus : Detecting Dense and Arbitrary-shaped Scene Text by Instance-aware Component Grouping
    Tang, Jun
    Yang, Zhibo
    Wang, Yongpan
    Zheng, Qi
    Xu, Yongchao
    Bai, Xiang
    PATTERN RECOGNITION, 2019, 96
  • [20] UTextNet: A UNet Based Arbitrary Shaped Scene Text Detector
    Naosekpam, Veronica
    Aggarwal, Sushant
    Sahu, Nilkanta
    INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 368 - 378