Direct Unsupervised Text Line Extraction from Colored Historical Manuscript Images Using DCT

被引:0
作者
Baig, Asim [1 ]
Al-Maadeed, Somaya [1 ]
Bouridane, Ahmed [2 ]
Cheriet, Mohamed [3 ]
机构
[1] Qatar Univ, Doha, Qatar
[2] Northumbria Univ, Newcastle Upon Tyne, Tyne & Wear, England
[3] Ecole Technol Super, Montreal, PQ, Canada
来源
IMAGE ANALYSIS AND RECOGNITION (ICIAR 2016) | 2016年 / 9730卷
关键词
Text line extraction; Segmentation; DCT; Historical manuscripts; Color image processing;
D O I
10.1007/978-3-319-41501-7_84
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting lines of text from a manuscript is an important preprocessing step in many digital paleography applications. These extracted lines play a fundamental part in the identification of the author and/or age of the manuscript. In this paper we present an unsupervised approach to text line extraction in historical manuscripts that can be applied directly to a color manuscript image. Each of the red, green and blue channels are processed separately by applying DCT on them individually. One of the key advantages of this approach is that it can be applied directly to the manuscript image without any preprocessing, training or tuning steps. Extensive testing on complex Arabic handwritten manuscripts shows the effectiveness of the proposed approach.
引用
收藏
页码:753 / 762
页数:10
相关论文
共 21 条
[1]   DISCRETE COSINE TRANSFORM [J].
AHMED, N ;
NATARAJAN, T ;
RAO, KR .
IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) :90-93
[2]  
Alaql O, 2014, 2014 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), VOLS 1-2, P312, DOI 10.1109/ICALIP.2014.7009807
[3]   ICDAR2013 Competition on Historical Newspaper Layout Analysis-HNLA2013 [J].
Antonacopoulos, A. ;
Clausner, C. ;
Papadopoulos, C. ;
Pletschacher, S. .
2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, :1454-1458
[4]   Historical Document Layout Analysis Competition [J].
Antonacopoulos, A. ;
Clausner, C. ;
Papadopoulos, C. ;
Pletschacher, S. .
11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, :1516-1520
[5]   Seam Carving for Text Line Extraction on Color and Grayscale Historical Manuscripts [J].
Arvanitopoulos, Nikolaos ;
Suesstrunk, Sabine .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :726-731
[6]  
Bulacu M, 2007, PROC INT CONF DOC, P357
[7]   Writer Identification on Historical Glagolitic Documents [J].
Fiel, Stefan ;
Hollaus, Fabian ;
Gau, Melanie ;
Sablatnig, Robert .
DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
[8]  
Fischer A., 2010, P 9 IAPR INT WORKSH, P3, DOI [10.1145/1815330.1815331, DOI 10.1145/1815330.1815331]
[9]   Automatic Transcription of Handwritten Medieval Documents [J].
Fischer, Andreas ;
Wuethrich, Markus ;
Liwicki, Marcus ;
Frinken, Volkmar ;
Bunke, Horst ;
Viehhauser, Gabriel ;
Stolz, Michael .
2009 15TH INTERNATIONAL CONFERENCE ON VIRTUAL SYSTEMS AND MULTIMEDIA PROCEEDINGS (VSMM 2009), 2009, :137-+
[10]   A TWO-DIMENSIONAL FAST COSINE TRANSFORM [J].
HAQUE, MA .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (06) :1532-1539