Accurate Fine-Grained Layout Analysis for the Historical Tibetan Document Based on the Instance Segmentation

被引:7
作者
Zhao, Penghai [1 ]
Wang, Weilan [1 ]
Cai, Zhengqi [2 ]
Zhang, Guowei [1 ]
Lu, Yuqi [1 ]
机构
[1] Northwest Minzu Univ, Key Lab Chinas Ethn Languages & Informat Technol, Minist Educ, Lanzhou 730000, Peoples R China
[2] Northwest Minzu Univ, Sch Math & Comp Sci, Lanzhou 730000, Peoples R China
基金
中国国家自然科学基金;
关键词
Layout; Image segmentation; Text analysis; Annotations; Text recognition; Semantics; Character recognition; Document analysis and recognition; fine-grained layout analysis; historical Tibetan document images; layout analysis; text line segmentation; RECOGNITION;
D O I
10.1109/ACCESS.2021.3128536
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate layout analysis without subsequent text-line segmentation remains an ongoing challenge, especially when facing the Kangyur, a kind of historical Tibetan document featuring considerable touching components and mottled background. Aiming at identifying different regions in document images, layout analysis is indispensable for subsequent procedures such as character recognition. However, there was only a little research being carried out to perform line-level layout analysis which failed to deal with the Kangyur. To obtain the optimal results, a fine-grained sub-line level layout analysis approach is presented. Firstly, we introduced an accelerated method to build the dataset which is dynamic and reliable. Secondly, enhancement had been made to the SOLOv2 according to the characteristics of the Kangyur. Then, we fed the enhanced SOLOv2 with the prepared annotation file during the training phase. Once the network is trained, instances of the text line, sentence, and titles can be segmented and identified during the inference stage. The experimental results show that the proposed method delivers a decent 72.7% average precision on our dataset. In general, this preliminary research provides insights into the fine-grained sub-line level layout analysis and testifies the SOLOv2-based approaches. We also believe that the proposed methods can be adopted on other language documents with various layouts.
引用
收藏
页码:154435 / 154447
页数:13
相关论文
共 50 条
[21]   An Active and Contrastive Learning Framework for Fine-Grained Off-Road Semantic Segmentation [J].
Gao, Biao ;
Zhao, Xijun ;
Zhao, Huijing .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) :564-579
[22]   Fine-Grained Financial News Sentiment Analysis [J].
Meyer, Bradley ;
Bikdash, Marwan ;
Dai, Xiangfeng .
SOUTHEASTCON 2017, 2017,
[23]   Weakly Supervised Fine-grained Recognition in a Segmentation-attention Network [J].
Yu, Nannan ;
Zhang, Wenfeng ;
Cai, Huanhuan .
ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, :324-329
[24]   Exploring part-aware segmentation for fine-grained visual categorization [J].
Cheng Pang ;
Hongxun Yao ;
Xiaoshuai Sun ;
Sicheng Zhao ;
Yanhao Zhang .
Multimedia Tools and Applications, 2018, 77 :30291-30310
[25]   Few-Shot Learning for Fine-Grained Signal Modulation Recognition Based on Foreground Segmentation [J].
Zhang, Zilin ;
Li, Yan ;
Zhai, Qihang ;
Li, Yunjie ;
Gao, Meiguo .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) :2281-2292
[26]   Cross-Level Multi-Instance Distillation for Self-Supervised Fine-Grained Visual Categorization [J].
Bi, Qi ;
Ji, Wei ;
Yi, Jingjun ;
Zhan, Haolan ;
Xia, Gui-Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 :2954-2969
[27]   Investigation of Feature Selection for Historical Document Layout Analysis [J].
Wei, Hao ;
Chen, Kai ;
Nicolaou, Anguelos ;
Liwicki, Marcus ;
Ingold, Rolf .
2014 4TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2014, :215-220
[28]   Fine-Grained Visual Comparison Based on Relative Attribute Quadratic Discriminant Analysis [J].
Shi, Hanqin ;
Tao, Liang .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (06) :2113-2119
[29]   Few-Shot Pixel-Precise Document Layout Segmentation via Dynamic Instance Generation and Local Thresholding [J].
De Nardin, Axel ;
Zottin, Silvia ;
Piciarelli, Claudio ;
Colombi, Emanuela ;
Foresti, Gian Luca .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2023, 33 (10)
[30]   Fine-Grained Image Analysis With Deep Learning: A Survey [J].
Wei, Xiu-Shen ;
Song, Yi-Zhe ;
Mac Aodha, Oisin ;
Wu, Jianxin ;
Peng, Yuxin ;
Tang, Jinhui ;
Yang, Jian ;
Belongie, Serge .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) :8927-8948