Baseline estimation for Arabic handwritten words

被引:48
作者
Pechwitz, M [1 ]
Märgner, V [1 ]
机构
[1] Tech Univ Braunschweig, Inst Commun Technol, D-38092 Braunschweig, Germany
来源
EIGHTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION: PROCEEDINGS | 2002年
关键词
D O I
10.1109/IWFHR.2002.1030956
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Baseline information has been used for diverse purposes in handwriting research. The baseline represents a first orientation in a word and it is often a precondition for subsequent algorithms, including preprocessing tasks, segmentation and feature extraction for recognition systems. Approaches based on the horizontal projection histogram are used for Arabic printed text but they are ill-suited for Arabic handwritten words. In this paper we present a method that is completely based on polygonally approximated skeleton processing. The central algorithm is concerned with finding features in the skeleton and processing linear regression analysis. Our method performs very well as long as the model assumption of one straight line applies. We tested the method on 26459 isolated Tunisian town names written by 411 writers (IFN/ENIT-database).
引用
收藏
页码:479 / 484
页数:4
相关论文
共 13 条
[1]   SURVEY AND BIBLIOGRAPHY OF ARABIC OPTICAL TEXT RECOGNITION [J].
ALBADR, B ;
MAHMOUD, SA .
SIGNAL PROCESSING, 1995, 41 (01) :49-77
[2]   Off-line Arabic character recognition: The state of the art [J].
Amin, A .
PATTERN RECOGNITION, 1998, 31 (05) :517-530
[3]   Recognition of printed arabic text based on global features and decision tree learning techniques [J].
Amin, A .
PATTERN RECOGNITION, 2000, 33 (08) :1309-1323
[4]   An omnifont open-vocabulary OCR system for English and Arabic [J].
Bazzi, I ;
Schwartz, R ;
Makhoul, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1999, 21 (06) :495-504
[5]  
Bippus R., 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), P773, DOI 10.1109/ICDAR.1999.791902
[6]   AN IMPROVED ALGORITHM FOR THE SEQUENTIAL EXTRACTION OF BOUNDARIES FROM A RASTER SCAN [J].
CAPSON, DW .
COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1984, 28 (01) :109-125
[7]   Handwritten Farsi (Arabic) word recognition: a holistic approach using discrete HMM [J].
Dehghan, M ;
Faez, K ;
Ahmadi, M ;
Shridhar, M .
PATTERN RECOGNITION, 2001, 34 (05) :1057-1065
[8]   A graph-based segmentation and feature extraction framework for Arabic text recognition [J].
Elgammal, AM ;
Ismail, MA .
SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, :622-626
[9]   ULTRA-FAST PARALLEL CONTOUR TRACING, WITH APPLICATION TO THINNING [J].
FERREIRA, A ;
UBEDA, S .
PATTERN RECOGNITION, 1994, 27 (07) :867-878
[10]  
Margner V., 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition, P1159, DOI 10.1109/ICDAR.2001.953967