A Page Object Detection Method Based on Mask R-CNN

被引:15
作者
Xu, Canhui [1 ,2 ]
Shi, Cao [1 ]
Bi, Hengyue [1 ]
Liu, Chuanqi [1 ]
Yuan, Yongfeng [3 ]
Guo, Haoyan [3 ]
Chen, Yinong [2 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Image segmentation; Convolutional neural networks; Layout; Semantics; Object recognition; Page object detection; document images; deep learning; convolutional neural networks; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3121152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Page object detection is crucial for document understanding. Different granularities for objects can result in different performances. In this study, block level region object detection is considered among the inherent hierarchical structure for document images. Inspired by Mask R-CNN (Region-based Convolutional Neural Networks) method, an end to end network is proposed to perform object classification, bounding box identification, and page object mask generation at the same time. Latex based synthetic document generation is designed for enlarging the training data. A large number of synthetic page images are generated for training to alleviate the insufficient dataset problem. Compared with existing page object competition methods, the proposed method achieves better results, with mAP of 0.917 on page objects such as table, figure and maths detection.
引用
收藏
页码:143448 / 143457
页数:10
相关论文
共 50 条
  • [31] Image Object Detection Method Based on Improved Faster R-CNN
    Yin, Xiuye
    Chen, Liyong
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2024, 33 (07)
  • [32] A vehicle detection and tracking method for traffic video based on faster R-CNN
    Mohamed Othmani
    Multimedia Tools and Applications, 2022, 81 : 28347 - 28365
  • [33] Landslide Extraction Using Mask R-CNN with Background-Enhancement Method
    Yang, Ruilin
    Zhang, Feng
    Xia, Junshi
    Wu, Chuyi
    REMOTE SENSING, 2022, 14 (09)
  • [34] Remote sensing image building detection method based on Mask R-CNN
    Qinzhe Han
    Qian Yin
    Xin Zheng
    Ziyi Chen
    Complex & Intelligent Systems, 2022, 8 : 1847 - 1855
  • [35] Remote sensing image building detection method based on Mask R-CNN
    Han, Qinzhe
    Yin, Qian
    Zheng, Xin
    Chen, Ziyi
    COMPLEX & INTELLIGENT SYSTEMS, 2022, 8 (03) : 1847 - 1855
  • [36] Netting Damage Detection for Marine Aquaculture Facilities Based on Improved Mask R-CNN
    Zhang, Ziliang
    Gui, Fukun
    Qu, Xiaoyu
    Feng, Dejun
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (07)
  • [37] Steel Roll Eye Pose Detection Based on Binocular Vision and Mask R-CNN
    Su, Xuwu
    Wang, Jie
    Wang, Yifan
    Zhang, Daode
    SENSORS, 2025, 25 (06)
  • [38] An object detection method for catenary component images based on improved Faster R-CNN
    Wu, Changdong
    He, Xu
    Wu, Yanliang
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (08)
  • [39] MACD R-CNN: An Abnormal Cell Nucleus Detection Method
    Ma, Baoyan
    Zhang, Jian
    Cao, Feng
    He, Yongjun
    IEEE ACCESS, 2020, 8 (08): : 166658 - 166669
  • [40] A New Mask R-CNN-Based Method for Improved Landslide Detection
    Ullo, Silvia Liberata
    Mohan, Amrita
    Sebastianelli, Alessandro
    Ahamed, Shaik Ejaz
    Kumar, Basant
    Dwivedi, Ramji
    Sinha, Ganesh
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2021, 14 : 3799 - 3810