DOCUMENT BINARIZATION WITH MULTI-BRANCH GATED CONVOLUTIONAL GENERATIVE ADVERSARIAL NETWORKS

被引:0
|
作者
Yang, Zongyuan [1 ]
Xiong, Yongping [1 ]
Wu, Guibin [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing 100876, Peoples R China
关键词
Document binarization; Gated convolution; Multi-scale fusion; Adversarial learning; DIBCO; COMPETITION;
D O I
10.1109/ICIP49359.2023.10222024
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing document binarization methods can not extract stroke edges finely, mainly due to the fair-treatment nature of vanilla convolutions and the extraction of stroke edges without adequate supervision by boundary-related information. In this paper, we formulate text extraction as the learning of gating values and propose a novel end-to-end gated convolutions-based network (GDB) to solve the problem of imprecise stroke edge extraction. The gated convolutions are applied to selectively extract the features of strokes with different attention. Firstly, a coarse sub-network with an extra edge branch is trained to get more precise feature maps by feeding a priori mask and edge. Secondly, a refinement sub-network is cascaded to refine the output of the first stage by gated convolutions based on the sharp edge. For global information, GDB also contains a multi-scale operation to combine local and global features. Experimental results show that our proposed methods outperform the SOTA methods in terms of all metrics on average over all DIBCO datasets from 2009 to 2019 and achieve top ranking on six benchmark datasets. Available codes: https://github.com/Royalvice/GDB.
引用
收藏
页码:680 / 684
页数:5
相关论文
共 50 条
  • [1] Multi-branch evolutionary generative adversarial networks based on covariance crossover operators
    Shang, Qingzhen
    Yang, Jinfu
    Ma, Jiaqi
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [2] Document Image Binarization Using Dual Discriminator Generative Adversarial Networks
    De, Rajonya
    Chakraborty, Anuran
    Sarkar, Ram
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 1090 - 1094
  • [3] Document image binarization with cascaded generators of conditional generative adversarial networks
    Zhao, Jinyuan
    Shi, Cunzhao
    Jia, Fuxi
    Wang, Yanna
    Xiao, Baihua
    PATTERN RECOGNITION, 2019, 96
  • [4] Two-stage generative adversarial networks for binarization of color document images
    Suh, Sungho
    Kim, Jihun
    Lukowicz, Paul
    Lee, Yong Oh
    PATTERN RECOGNITION, 2022, 130
  • [5] MBA-RAINGAN: A MULTI-BRANCH ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR MIXTURE OF RAIN REMOVAL
    Shen, Yiyang
    Feng, Yidan
    Wang, Weiming
    Liang, Dong
    Qin, Jing
    Xie, Haoran
    Wei, Mingqiang
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 3418 - 3422
  • [6] Multi-branch neural networks with branch control
    Yamashita, T
    Hirasawa, K
    Hu, JL
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 756 - 761
  • [7] Multi-branch neural networks with Branch Control
    Yamashita, T
    Hirasawa, K
    Hu, JL
    SICE 2003 ANNUAL CONFERENCE, VOLS 1-3, 2003, : 2348 - 2353
  • [8] Heartbeat classification method combining multi-branch convolutional neural networks and transformer
    Zhou, Feiyan
    Wang, Jiannan
    ISCIENCE, 2024, 27 (03)
  • [9] Document Image Binarization with Fully Convolutional Neural Networks
    Tensmeyer, Chris
    Martinez, Tony
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 99 - 104
  • [10] Diverse sample generation with multi-branch conditional generative adversarial network for remote sensing objects detection
    Zhu, Dongjun
    Xia, Shixiong
    Zhao, Jiaqi
    Zhou, Yong
    Jian, Meng
    Niu, Qiang
    Yao, Rui
    Chen, Ying
    NEUROCOMPUTING, 2020, 381 : 40 - 51