Combining Deep and Ad-hoc Solutions to Localize Text Lines in Ancient Arabic Document Images

被引:2
作者
Mechi, Olfa [1 ]
Mehri, Maroua [1 ]
Ingold, Rolf [2 ]
Ben Amara, Najoua Essoukri [1 ]
机构
[1] Univ Sousse, LATIS Lab Adv Technol & Intelligent Syst, Ecole Natl Ingn Sousse, Sousse 4023, Tunisia
[2] Univ Fribourg, DIVA Grp, CH-1700 Fribourg, Switzerland
来源
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2021年
关键词
Ancient Arabic document images; text line localization; deep architecture; ad-hoc post-processing; SEGMENTATION; RECOGNITION; COMPETITION; EXTRACTION;
D O I
10.1109/ICPR48806.2021.9412562
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text line localization in document images is still considered an open research task. The state-of-the-art methods in this regard that are only based on the classical image analysis techniques mostly have unsatisfactory performances especially when the document images i) contain significant degradations and different noise types and scanning defects, and ii) have touching and/or multi-skewed text lines or overlapping words/characters and non-uniform inter-line space. Moreover, localizing text in ancient handwritten Arabic document images is even more complex due to the morphological particularities related to the Arabic script. Thus, in this paper, we propose a hybrid method combining a deep network with classical document i mage analysis techniques for text line localization in ancient handwritten Arabic document images. The proposed method is firstly based on using the U-Net architecture to extract the main area covering the text core. Then, a modified RLSA combined with topological structural analysis are applied to localize whole text lines (including the ascender and descender components). To analyze the performance of the proposed method, a set of experiments has been conducted on many recent public and private datasets, and a thorough experimental evaluation has been carried out.
引用
收藏
页码:7759 / 7766
页数:8
相关论文
共 26 条
[1]  
Alberti Michele, 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR). Proceedings, P1200, DOI 10.1109/ICDAR.2019.00194
[2]  
[Anonymous], 2015, P IEEE C COMP VIS PA
[3]   Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network [J].
Barakat, Berat ;
Droby, Ahmad ;
Kassis, Majeed ;
El-Sana, Jihad .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :374-379
[4]   Gated Convolutional Recurrent Neural Networks for Multilingual Handwriting Recognition [J].
Bluche, Theodore ;
Messina, Ronaldo .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :646-651
[5]   ICFHR 2018 Competition on Recognition of Historical Arabic Scientific Manuscripts-RASM2018 [J].
Clausner, Christian ;
Antonacopoulos, Apostolos ;
McGregor, Nora ;
Wilson-Nunn, Daniel .
PROCEEDINGS 2018 16TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2018, :471-476
[6]   cBAD: ICDAR2017 Competition on Baseline Detection [J].
Diem, Markus ;
Kleber, Florian ;
Fiel, Stefan ;
Gatos, Basilis ;
Gruening, Tobias .
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, :1355-1360
[7]  
Galibert O, 2014, IEEE IMAGE PROC, P2594, DOI 10.1109/ICIP.2014.7025525
[8]   A two-stage method for text line detection in historical documents [J].
Gruening, Tobias ;
Leifert, Gundram ;
Strauss, Tobias ;
Michael, Johannes ;
Labahn, Roger .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2019, 22 (03) :285-302
[9]  
Hongxi Wei, 2019, Natural Language Processing and Chinese Computing. 8th CCF International Conference, NLPCC 2019. Proceedings. Lecture Notes in Artificial Intelligence, Subseries of Lecture Notes in Computer Science (LNAI 11839), P220, DOI 10.1007/978-3-030-32236-6_19
[10]  
Likforman-Sulem L., 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P774, DOI 10.1109/ICDAR.1995.602017