ARABIC CHARACTER-RECOGNITION USING FOURIER DESCRIPTORS AND CHARACTER CONTOUR ENCODING

被引:51
作者
MAHMOUD, SA
机构
[1] Computer Engineering Department, College of Computers and Information Sciences, King Saud University, Riyadh, 11543
关键词
ARABIC CHARACTER RECOGNITION; OCR; FOURIER DESCRIPTORS; CONTOUR ANALYSIS; CURVATURE FEATURES; DIRECTION FEATURES;
D O I
10.1016/0031-3203(94)90166-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Normalized Fourier descriptors are known to be invariant to scale, translation, and rotation. This technique was used by researchers of Latin OCR yielding acceptable results. In addition, contour analysis was used in object recognition with success. Both techniques are adopted as they are necessary for the recognition of Arabic characters with acceptable recognition rates. This combination was deemed necessary due to the special characteristics of Arabic characters that have some very similar characters. The character images are smoothed by a statistically-based algorithm to eliminate noise. Then, the contours of the image (namely the character primary part, the dots, and hole contours) are extracted. Fourier descriptors and curvature features of the primary part of the character are computed. These features of the training set are used as the model features. The features of an input character are compared to the models' features using a distance measure. The model with the minimum distance is taken as the class representing the character. The dots' and holes' features are then used to specify the particular character. Experimental results have shown that the combination of the Fourier descriptors, the curvature features and the use of dots' and holes' features to be powerful in successfully classifying Arabic characters. Recognition rates of 100% were achieved for the model classes. However, this rate has come down to 98% in the post-recognition phase of identifying the specific characters. The major part of these errors come from corrupted data.
引用
收藏
页码:815 / 824
页数:10
相关论文
共 25 条
[1]  
ABDELAZIM HY, 1989, P 11 NAT COMP C DHAH, P287
[2]  
ABDELAZIM HY, 1990, 12TH P NAT COMP C RI, P427
[3]   A METHOD OF RECOGNITION OF ARABIC CURSIVE HANDWRITING [J].
ALMUALLIM, H ;
YAMAGUCHI, S .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1987, 9 (05) :715-722
[4]  
ALYOUSEFI H, 1988, APPLICATIONS DIGITAL, V11, P330
[5]  
ALYOUSEFI HS, 1990, ARAB GULF J SCI RES, V8, P49
[6]   MACHINE RECOGNITION AND CORRECTION OF PRINTED ARABIC TEXT [J].
AMIN, A ;
MARI, JF .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05) :1300-1306
[7]   APPLICATION OF AFFINE-INVARIANT FOURIER DESCRIPTORS TO RECOGNITION OF 3-D OBJECTS [J].
ARBTER, K ;
SNYDER, WE ;
BURKHARDT, H ;
HIRZINGER, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (07) :640-647
[8]  
BORGHESI P, 1984, DIGITAL SIGNAL PROCE, V84
[9]   RECOGNITION OF PRINTED CHINESE CHARACTERS [J].
CASEY, R ;
NAGY, G .
IEEE TRANSACTIONS ON ELECTRONIC COMPUTERS, 1966, EC15 (01) :91-&
[10]  
ELDABI SS, 1990, PATTERN RECOGN, V23, P485, DOI 10.1016/0031-3203(90)90069-W