New features using fractal multi-dimensions for generalized Arabic font recognition

被引:45
作者
Ben Moussa, Sami [1 ,2 ]
Zahour, Abderrazak [2 ]
Benabdelhafid, Abdellatif [2 ]
Alimi, Adel M. [1 ]
机构
[1] Natl Sch Engineers Sfax, REGIM, Sfax 3038, Tunisia
[2] Univ Havre, F-76063 Le Havre, France
关键词
Optical font recognition; Fractal features; Texture analysis; OCR; Arabic written; DIMENSION; CHARACTER;
D O I
10.1016/j.patrec.2009.10.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, a new method is proposed to the widely neglected problem of Arabic font recognition, it uses global texture analysis. This method is based on fractal geometry, and the feature extraction does not depend on the document contents. In Our method, we take the document as an image containing some specific textures and regard font recognition as texture identification. We have combined both techniques BCD (box counting dimension) and DCD (dilation Counting dimension) to obtain the main features. The first expresses texture distribution in 2-D image. The second makes possible to take on the human vision system aspect, since it makes it possible to differentiate one font from another. Both features are expressed in a parametric form: then four features were kept. Experiments are carried out by using 1000 samples of 10 typefaces (each typeface is combined with four sizes). The average recognition rates are of about 96.2% using KNN (K nearest neighbor) and 98% using RBF (radial basic function). Experimental results are also included in the robustness of the method against written size, skew, image degradation (e.g., Gaussian noise) and resolution, and compared with the existing methods. The main advantages of-our method are that (1) the dimension of feature vector is very low; (2) the variation sizes of the studied blocks (which are not standardized) are robust: (3) less samples are needed to train the classifier; (4) finally and the most important, is the first attempt to apply and adapt fractal dimensions to font recognition. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:361 / 371
页数:11
相关论文
共 40 条
[1]  
AJAY BK, 2001, PATTERN RECOGN, V22, P631
[2]   Evolutionary computation for the recognition of on-line cursive handwriting [J].
Alimi, AM .
IETE JOURNAL OF RESEARCH, 2002, 48 (05) :385-396
[3]  
ALIMI AM, 1995, ICDAR, V1, P382
[4]  
ALIMI AM, 1997, P IEEE INT C NEUR NE, V3, P1397
[5]   Off-line Arabic character recognition: The state of the art [J].
Amin, A .
PATTERN RECOGNITION, 1998, 31 (05) :517-530
[6]   Recognition of printed arabic text based on global features and decision tree learning techniques [J].
Amin, A .
PATTERN RECOGNITION, 2000, 33 (08) :1309-1323
[7]   High-order statistical texture analysis -: font recognition applied [J].
Avilés-Cruz, C ;
Rangel-Kuoppa, R ;
Reyes-Ayala, M ;
Andrade-Gonzalez, A ;
Escarela-Perez, R .
PATTERN RECOGNITION LETTERS, 2005, 26 (02) :135-145
[8]  
BALL G, 2006, P IWFHR 10 BAUL FRAN
[9]  
BELAID A, 2005, P INT WORKSH DOC AN
[10]  
Ben Moussa S., 2008, 19 INT C PATT REC IC