Historical Text Line Segmentation Using Deep Learning Algorithms: Mask-RCNN against U-Net Networks

被引:5
作者
Fizaine, Florian Come [1 ,2 ]
Bard, Patrick [1 ]
Paindavoine, Michel [1 ]
Robin, Cecile [2 ,3 ]
Bouye, Edouard [2 ]
Lefevre, Raphael [4 ]
Vinter, Annie [1 ]
机构
[1] Univ Bourgogne, LEAD CNRS, F-21000 Dijon, France
[2] Arch Dept Cote dOr, F-21000 Dijon, France
[3] Inst Natl Patrimoine, F-75002 Paris, France
[4] Soc Natl Chemins Fer Francais, F-93200 St Denis, France
关键词
deep learning; line segmentation; instance segmentation; Mask-RCNN; U-Net; historical document analysis; DOCUMENTS;
D O I
10.3390/jimaging10030065
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Text line segmentation is a necessary preliminary step before most text transcription algorithms are applied. The leading deep learning networks used in this context (ARU-Net, dhSegment, and Doc-UFCN) are based on the U-Net architecture. They are efficient, but fall under the same concept, requiring a post-processing step to perform instance (e.g., text line) segmentation. In the present work, we test the advantages of Mask-RCNN, which is designed to perform instance segmentation directly. This work is the first to directly compare Mask-RCNN- and U-Net-based networks on text segmentation of historical documents, showing the superiority of the former over the latter. Three studies were conducted, one comparing these networks on different historical databases, another comparing Mask-RCNN with Doc-UFCN on a private historical database, and a third comparing the handwritten text recognition (HTR) performance of the tested networks. The results showed that Mask-RCNN outperformed ARU-Net, dhSegment, and Doc-UFCN using relevant line segmentation metrics, that performance evaluation should not focus on the raw masks generated by the networks, that a light mask processing is an efficient and simple solution to improve evaluation, and that Mask-RCNN leads to better HTR performance.
引用
收藏
页数:18
相关论文
共 50 条
[31]   Deep Learning-Based Retinal Blood Vessel Segmentation Using U-Net Architecture [J].
Boazu, Ligia-Gabriela ;
Petraru, Marian-Alexandru ;
Zvoristeanu, Otilia .
2024 12TH E-HEALTH AND BIOENGINEERING CONFERENCE, EHB 2024, 2024, :269-272
[32]   Texture Segmentation: An Objective Comparison between Five Traditional Algorithms and a Deep-Learning U-Net Architecture [J].
Karabag, Cefa ;
Verhoeven, Jo ;
Miller, Naomi Rachel ;
Reyes-Aldasoro, Constantino Carlos .
APPLIED SCIENCES-BASEL, 2019, 9 (18)
[33]   Edge U-Net: Brain tumor segmentation using MRI based on deep U-Net model with boundary information [J].
Allah, Ahmed M. Gab ;
Sarhan, Amany M. ;
Elshennawy, Nada M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
[34]   Cascaded 3-Stage Nuclei Segmentation using U-net, Faster-RCNN and SegNet for Higher Precision [J].
Shihavuddin, A. S. M. ;
Kiron, Mohammad Kamrozzaman ;
Islam, Md Imamul ;
Maruf, Md Hasan ;
Ashique, Ratil H. ;
Kabir, Shahriar Mahmud .
2021 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE TECHNOLOGIES FOR INDUSTRY 4.0 (STI), 2021,
[35]   Adrenal Tumor Segmentation on U-Net: A Study About Effect of Different Parameters in Deep Learning [J].
Solak, Ahmet ;
Ceylan, Rahime ;
Bozkurt, Mustafa Alper ;
Cebeci, Hakan ;
Koplay, Mustafa .
VIETNAM JOURNAL OF COMPUTER SCIENCE, 2024, 11 (01) :111-135
[36]   Development of Deep Learning with RDA U-Net Network for Bladder Cancer Segmentation [J].
Lee, Ming-Chan ;
Wang, Shao-Yu ;
Pan, Cheng-Tang ;
Chien, Ming-Yi ;
Li, Wei-Ming ;
Xu, Jin-Hao ;
Luo, Chi-Hung ;
Shiue, Yow-Ling .
CANCERS, 2023, 15 (04)
[37]   Automatic segmentation of rectal tumor on diffusion-weighted images by deep learning with U-Net [J].
Zhu, Hai-Tao ;
Zhang, Xiao-Yan ;
Shi, Yan-Jie ;
Li, Xiao-Ting ;
Sun, Ying-Shi .
JOURNAL OF APPLIED CLINICAL MEDICAL PHYSICS, 2021, 22 (09) :324-331
[38]   Brain Tumour Segmentation Using U-net Based Adversarial Networks [J].
Teki, Satyanarayana Murthy ;
Varma, Mohan Krishna ;
Yadav, Anjana K. .
TRAITEMENT DU SIGNAL, 2019, 36 (04) :353-359
[39]   A U-Net Deep Learning Framework for High Performance Vessel Segmentation in Patients With Cerebrovascular Disease [J].
Livne, Michelle ;
Rieger, Jana ;
Aydin, Orhun Utku ;
Taha, Abdel Aziz ;
Akay, Ela Marie ;
Kossen, Tabea ;
Sobesky, Jan ;
Kelleher, John D. ;
Hildebrand, Kristian ;
Frey, Dietmar ;
Madai, Vince, I .
FRONTIERS IN NEUROSCIENCE, 2019, 13
[40]   Overlapping Chromosome Segmentation using U-Net: Convolutional Networks with Test Time Augmentation [J].
Saleh, Hariyanti Mohd ;
Saad, Nor Hidayah ;
Isa, Nor Ashidi Mat .
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES 2019), 2019, 159 :524-533