A Page Object Detection Method Based on Mask R-CNN

被引:15
|
作者
Xu, Canhui [1 ,2 ]
Shi, Cao [1 ]
Bi, Hengyue [1 ]
Liu, Chuanqi [1 ]
Yuan, Yongfeng [3 ]
Guo, Haoyan [3 ]
Chen, Yinong [2 ]
机构
[1] Qingdao Univ Sci & Technol, Sch Informat Sci & Technol, Qingdao 266061, Peoples R China
[2] Arizona State Univ, Sch Comp Informat & Decis Syst Engn, Tempe, AZ 85287 USA
[3] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Image segmentation; Convolutional neural networks; Layout; Semantics; Object recognition; Page object detection; document images; deep learning; convolutional neural networks; CLASSIFICATION;
D O I
10.1109/ACCESS.2021.3121152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Page object detection is crucial for document understanding. Different granularities for objects can result in different performances. In this study, block level region object detection is considered among the inherent hierarchical structure for document images. Inspired by Mask R-CNN (Region-based Convolutional Neural Networks) method, an end to end network is proposed to perform object classification, bounding box identification, and page object mask generation at the same time. Latex based synthetic document generation is designed for enlarging the training data. A large number of synthetic page images are generated for training to alleviate the insufficient dataset problem. Compared with existing page object competition methods, the proposed method achieves better results, with mAP of 0.917 on page objects such as table, figure and maths detection.
引用
收藏
页码:143448 / 143457
页数:10
相关论文
共 50 条
  • [21] Gas mask wearing detection based on Faster R-CNN
    Wang, Bangrong
    Wang, Jun
    Xu, Xiaofeng
    Bao, Xianglin
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2023, 16 (01) : 57 - 71
  • [22] Multi-Class Object Detection from Aerial Images Using Mask R-CNN
    Schweitzer, David
    Agrawal, Rajeev
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3470 - 3477
  • [23] Application of generated mask method based on Mask R-CNN in classification and detection of melanoma
    Cao, Xingmei
    Pan, Jeng-Shyang
    Wang, Zhengdi
    Sun, Zhonghai
    ul Haq, Anwar
    Deng, Wenyu
    Yang, Shuangyuan
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 207
  • [24] Pedestrian Detection Using R-CNN Object Detector
    Masita, Katleho L.
    Hasan, Ali N.
    Paul, Satyakama
    2018 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2018,
  • [25] The Pest and Disease Identification in the Growth of Sweet Peppers Using Faster R-CNN and Mask R-CNN
    Lin, Tu-Liang
    Chang, Hong-Yi
    Chen, Kai-Hong
    JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (02): : 605 - 614
  • [26] Detection of Parking Slots Based on Mask R-CNN
    Jiang, Shaokang
    Jiang, Haobin
    Ma, Shidian
    Jiang, Zhongxu
    APPLIED SCIENCES-BASEL, 2020, 10 (12):
  • [27] Potato Detection and Segmentation Based on Mask R-CNN
    Lee H.-S.
    Shin B.-S.
    Journal of Biosystems Engineering, 2020, 45 (4) : 233 - 238
  • [28] Human Detection Based on Improved Mask R-CNN
    Wang, Yuejuan
    Wu, Ji
    Li, Heting
    5TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2020), 2020, 1575
  • [29] INSHORE SHIP DETECTION BASED ON MASK R-CNN
    Nie, Shanlan
    Jiang, Zhiguo
    Zhang, Haopeng
    Cai, Bowen
    Yao, Yuan
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 693 - 696
  • [30] A vehicle detection and tracking method for traffic video based on faster R-CNN
    Othmani, Mohamed
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (20) : 28347 - 28365