Handling Diacritical Marks for Online Arabic Script Based Languages Character Recognition using Fuzzy c-mean Clustering and Relative Position

被引:0
作者
Razzak, Muhammad Imran [1 ,3 ]
Husain, Syed Afaq [2 ]
Khan, Muhammad Khurram [3 ]
Sher, Muhammad
机构
[1] Int Islamic Univ, Dept Comp Sci, Islamabad, Pakistan
[2] Ripha Int Univ, Islamabad, Pakistan
[3] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh 11451, Saudi Arabia
来源
INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL | 2011年 / 14卷 / 01期
关键词
Diacritical Marks; Arabic; Urdu; Persian; Character Recognition; Relative Position; HANDWRITING RECOGNITION;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Arabic script is used by more than 1/4th population of the world in the form of different languages like Arabic, Persian, Urdu, Sindhi, Pashto etc whereas each language has its own words meaning, grammatical and writing rules and set of alphabets. Whereas on the other hand it is very difficult to deal with Arabic script based languages due its complex nature of this script. Moreover these language are rich in diacritical marks associated with each character and it is one of the main issue due to variability, position, shape and association with different character. We present a novel technique to handle diacritical marks based on the relative position of diacritical marks with respect to associated character and using mean of fuzzy c-mean which determined each cluster location using member defuzzification and stroke order. The diacritical marks concerned character is estimated by combining the position information and fuzzy mapping on to the character. Experiment shows that the proposed technique preformed well to deal with diacritical marks.
引用
收藏
页码:157 / 165
页数:9
相关论文
共 14 条