A Page Object Detection Method Based on Mask R-CNN

被引:15
|
作者
Xu, Canhui [1 ,2 ]
Shi, Cao [1 ]
Bi, Hengyue [1 ]
Liu, Chuanqi [1 ]
Yuan, Yongfeng [3 ]
Guo, Haoyan [3 ]
Chen, Yinong [2 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Image segmentation; Convolutional neural networks; Layout; Semantics; Object recognition; Page object detection; document images; deep learning; convolutional neural networks; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3121152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Page object detection is crucial for document understanding. Different granularities for objects can result in different performances. In this study, block level region object detection is considered among the inherent hierarchical structure for document images. Inspired by Mask R-CNN (Region-based Convolutional Neural Networks) method, an end to end network is proposed to perform object classification, bounding box identification, and page object mask generation at the same time. Latex based synthetic document generation is designed for enlarging the training data. A large number of synthetic page images are generated for training to alleviate the insufficient dataset problem. Compared with existing page object competition methods, the proposed method achieves better results, with mAP of 0.917 on page objects such as table, figure and maths detection.
引用
收藏
页码:143448 / 143457
页数:10
相关论文
共 50 条
  • [1] Object detection based on RGC mask R-CNN
    Wu, Minghu
    Yue, Hanhui
    Wang, Juan
    Huang, Yongxi
    Liu, Min
    Jiang, Yuhan
    Ke, Cong
    Zeng, Cheng
    IET IMAGE PROCESSING, 2020, 14 (08) : 1502 - 1508
  • [2] R-CNN Based Vehicle Object Detection via Segmentation Capabilities in Road Scenes
    Riaz Chughtai, Bisma
    Alhasson, Haifa F.
    Alnusayri, Mohammed
    Alatiyyah, Mohammed
    Aljuaid, Hanan
    Jalal, Ahmad
    Park, Jeongmin
    IEEE ACCESS, 2025, 13 : 3355 - 3370
  • [3] Ganster R-CNN: Occluded Object Detection Network Based on Generative Adversarial Nets and Faster R-CNN
    Sun, Kelei
    Wen, Qiufen
    Zhou, Huaping
    IEEE ACCESS, 2022, 10 : 105022 - 105030
  • [4] Mask R-CNN
    He, Kaiming
    Gkioxari, Georgia
    Dollar, Piotr
    Girshick, Ross
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (02) : 386 - 397
  • [5] Mask R-CNN Object Detection Method Based on Improved Feature Pyramid
    Ren Zhijun
    Lin Suzhen
    Li Dawei
    Wang Lifang
    Zuo Jianhong
    LASER & OPTOELECTRONICS PROGRESS, 2019, 56 (04)
  • [6] Detection Method of Corn Weed Based on Mask R-CNN
    Jiang H.
    Zhang C.
    Zhang Z.
    Mao W.
    Wang D.
    Wang D.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2020, 51 (06): : 220 - 228and247
  • [7] Deep Orange: Mask R-CNN based Orange Detection and Segmentation
    Ganesh, P.
    Volle, K.
    Burks, T. F.
    Mehta, S. S.
    IFAC PAPERSONLINE, 2019, 52 (30): : 70 - 75
  • [8] Pedestrian Detection and Segmentation Method Based on Mask R-CNN
    Chen, Jia-jun
    Qing, Xiao-qu
    Yu, Hua-peng
    Chang, Yong-xin
    2018 INTERNATIONAL CONFERENCE ON ELECTRICAL, CONTROL, AUTOMATION AND ROBOTICS (ECAR 2018), 2018, 307 : 459 - 463
  • [9] A Dense Attention Railway Foreign Object Detection Algorithm Based on Mask R-CNN
    Gao, Shuang
    IEEE ACCESS, 2024, 12 : 85761 - 85772
  • [10] Solder Joint Recognition Using Mask R-CNN Method
    Wu, Hao
    Gao, Wenbin
    Xu, Xiangrong
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2020, 10 (03): : 525 - 530