Accurate Fine-Grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

被引:7
|
作者
Zhao, Penghai [1 ]
Wang, Weilan [1 ]
Cai, Zhengqi [2 ]
Zhang, Guowei [1 ]
Lu, Yuqi [1 ]
机构
[1] Northwest Minzu Univ, Key Lab Chinas Ethn Languages & Informat Technol, Minist Educ, Lanzhou 730000, Peoples R China
[2] Northwest Minzu Univ, Sch Math & Comp Sci, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
Layout; Image segmentation; Text analysis; Annotations; Text recognition; Semantics; Character recognition; Document analysis and recognition; fine-grained layout analysis; historical Tibetan document images; layout analysis; text line segmentation; RECOGNITION;
D O I
10.1109/ACCESS.2021.3128536
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate layout analysis without subsequent text-line segmentation remains an ongoing challenge, especially when facing the Kangyur, a kind of historical Tibetan document featuring considerable touching components and mottled background. Aiming at identifying different regions in document images, layout analysis is indispensable for subsequent procedures such as character recognition. However, there was only a little research being carried out to perform line-level layout analysis which failed to deal with the Kangyur. To obtain the optimal results, a fine-grained sub-line level layout analysis approach is presented. Firstly, we introduced an accelerated method to build the dataset which is dynamic and reliable. Secondly, enhancement had been made to the SOLOv2 according to the characteristics of the Kangyur. Then, we fed the enhanced SOLOv2 with the prepared annotation file during the training phase. Once the network is trained, instances of the text line, sentence, and titles can be segmented and identified during the inference stage. The experimental results show that the proposed method delivers a decent 72.7% average precision on our dataset. In general, this preliminary research provides insights into the fine-grained sub-line level layout analysis and testifies the SOLOv2-based approaches. We also believe that the proposed methods can be adopted on other language documents with various layouts.
引用
收藏
页码:154435 / 154447
页数:13
相关论文
共 50 条
  • [1] Object Instance Segmentation and Fine-Grained Localization Using Hypercolumns
    Hariharan, Bharath
    Arbelaez, Pablo
    Girshick, Ross
    Malik, Jitendra
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (04) : 627 - 639
  • [2] Instance segmentation algorithm based on fine-grained feature perception and cross-path aggregation
    Ma, Jianxin
    Gu, Songbo
    Deng, Yangyang
    Ao, Tianyong
    KNOWLEDGE-BASED SYSTEMS, 2023, 276
  • [3] RefineMask: Towards High-Quality Instance Segmentation with Fine-Grained Features
    Zhang, Gang
    Lu, Xin
    Tan, Jingru
    Li, Jianmin
    Zhang, Zhaoxiang
    Li, Quanquan
    Hu, Xiaolin
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 6857 - 6865
  • [4] UrbanBIS: a Large-scale Benchmark for Fine-grained Urban Building Instance Segmentation
    Yang, Guoqing
    Xue, Fuyou
    Zhang, Qi
    Xie, Ke
    Fu, Chi-Wing
    Huang, Hui
    PROCEEDINGS OF SIGGRAPH 2023 CONFERENCE PAPERS, SIGGRAPH 2023, 2023,
  • [5] FINE-GRAINED BUILDING ROOF INSTANCE SEGMENTATION BASED ON DOMAIN ADAPTED PRETRAINING AND COMPOSITE DUAL-BACKBONE
    Liu, Guozhang
    Peng, Baochai
    Liu, Ting
    Zhang, Pan
    Yuan, Mengke
    Lu, Chaoran
    Cao, Ningning
    Zhang, Sen
    Huang, Simin
    Wang, Tao
    IGARSS 2023 - 2023 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2023, : 670 - 673
  • [6] One-Shot Fine-Grained Instance Retrieval
    Yao, Hantao
    Zhang, Shiliang
    Zhang, Yongdong
    Li, Jintao
    Tian, Qi
    PROCEEDINGS OF THE 2017 ACM MULTIMEDIA CONFERENCE (MM'17), 2017, : 342 - 350
  • [7] Instance Switching-Based Contrastive Learning for Fine-Grained Airplane Detection
    Zeng, Lanxin
    Guo, Haowen
    Yang, Wen
    Yu, Huai
    Yu, Lei
    Zhang, Peng
    Zou, Tongyuan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [8] Fine-Grained Instance-Level Sketch-Based Video Retrieval
    Xu, Peng
    Liu, Kun
    Xiang, Tao
    Hospedales, Timothy M.
    Ma, Zhanyu
    Guo, Jun
    Song, Yi-Zhe
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (05) : 1995 - 2007
  • [9] Fine-Grained Instance-Level Sketch-Based Image Retrieval
    Qian Yu
    Jifei Song
    Yi-Zhe Song
    Tao Xiang
    Timothy M. Hospedales
    International Journal of Computer Vision, 2021, 129 : 484 - 500
  • [10] Fine-Grained Instance-Level Sketch-Based Image Retrieval
    Yu, Qian
    Song, Jifei
    Song, Yi-Zhe
    Xiang, Tao
    Hospedales, Timothy M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 484 - 500