Binarization, character extraction, and writer identification of historical Hebrew calligraphy documents

被引:44
作者
Bar-Yosef, Itay [1 ]
Beckman, Isaac
Kedem, Klara
Dinstein, Itshak
机构
[1] Ben Gurion Univ Negev, Dept Comp Sci, IL-84105 Beer Sheva, Israel
[2] Ben Gurion Univ Negev, Dept Elect & Comp Engn, IL-84105 Beer Sheva, Israel
关键词
binarization; character extraction; writer identification; document analysis; historical documents;
D O I
10.1007/s10032-007-0041-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present our work on the paleographic analysis and recognition system intended for processing of historical Hebrew calligraphy documents. The main goal is to analyze documents of different writing styles in order to identify the locations, dates, and writers of test documents. Using interactive software tools, a data base of extracted characters has been established. It now contains about 20,000 characters of 34 different writers, and will be distinctly expanded in the near future. Preliminary results of automatic extraction of pre-specified letters using the erosion operator are presented. We further propose and test topological features for handwriting style classification based on a selected subset of the Hebrew alphabet. A writer identification experiment using 34 writers yielded 100% correct classification.
引用
收藏
页码:89 / 99
页数:11
相关论文
共 23 条
[1]  
Ablavsky V, 2003, PROC INT CONF DOC, P750
[2]   A segmentation-free approach to text recognition with application to Arabic text [J].
Al-Badr B. ;
Haralick R.M. .
International Journal on Document Analysis and Recognition, 1998, 1 (3) :147-166
[3]   Input sensitive thresholding for ancient Hebrew manuscript [J].
Bar-Yosef, I .
PATTERN RECOGNITION LETTERS, 2005, 26 (08) :1168-1173
[4]  
BEITARIE M, 1991, PALEOGRAPHICAL IDENT, P15
[5]   LINEAR-TIME EUCLIDEAN DISTANCE TRANSFORM ALGORITHMS [J].
BREU, H ;
GIL, J ;
KIRKPATRICK, D ;
WERMAN, M .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (05) :529-533
[6]  
Bulacu M, 2003, PROC INT CONF DOC, P937
[7]  
DINSTEIN I, 1982, IEEE T SYST MAN CYB, V12, P405
[8]  
Duda R. O., 1973, Pattern Classification
[9]  
FOURNIER JM, 1971, ISRAEL J TECHNOL, V9, P281
[10]   IMAGE-ANALYSIS USING MATHEMATICAL MORPHOLOGY [J].
HARALICK, RM ;
STERNBERG, SR ;
ZHUANG, XH .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (04) :532-550