Segmentation of Arabic Text into Characters for Recognition

被引:0
|
作者
Shaikh, Noor Ahmed [1 ]
Shaikh, Zubair Ahmed [2 ]
Ali, Ghulam [1 ]
机构
[1] Shah A Latif Univ, Khairpur, Pakistan
[2] FAST NU, Karachi, Pakistan
来源
WIRELESS NETWORKS, INFORMATION PROCESSING AND SYSTEMS | 2008年 / 20卷
关键词
Arabic Text Segmentation; Sindhi OCR;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
One of the steps of character recognition systems is the segmentation of words/sub-words into characters. The segmentation of text written in any Arabic script is a most difficult task. Due to this difficulty, many systems consider sub-words instead of a character as the basic unit for recognition. We propose a method for the segmentation of printed Arabic words/sub-words into characters. In the proposed method, primary and secondary strokes of the sub-words are separated and then segmentation points are identified in the primary strokes. For this, we compute the vertical projection graph for each line, which is then processed to generate a string indicating relative variations in pixels. The string is scanned further to produce characters from the sub-words. In the proposed method we use Sindhi text for segmentation into characters as its character set is the super set of Arabic. This method can be used for any other Naskh-based Arabic script such as Persian, Pashto and Urdu.
引用
收藏
页码:11 / +
页数:2
相关论文
共 50 条
  • [1] Segmentation and recognition of Arabic characters by structural classification
    Bushofa, BMF
    Spann, M
    IMAGE AND VISION COMPUTING, 1997, 15 (03) : 167 - 179
  • [2] RECOGNITION OF HANDWRITTEN ARABIC CHARACTERS VIA SEGMENTATION
    ALYOUSEFI, HS
    UDPA, SS
    ARAB GULF JOURNAL OF SCIENTIFIC RESEARCH, 1990, 8 (02): : 49 - 59
  • [3] Segmentation and recognition of Arabic characters by structural classification
    Univ of Birmingham, Birmingham, United Kingdom
    Image Vision Comput, 3 (167-179):
  • [4] A segmentation-free approach to text recognition with application to Arabic text
    Al-Badr B.
    Haralick R.M.
    International Journal on Document Analysis and Recognition, 1998, 1 (3) : 147 - 166
  • [5] A segmentation-free approach to text recognition with application to Arabic text
    Department of Computer Science and Engineering, University of Washington, Mail Stop FR-35, Seattle, WA 98195, United States
    Int. J. Doc. Anal. Recogn., 3 (147-166):
  • [6] An Efficient Segmentation Algorithm for Arabic Handwritten Characters Recognition System
    Fadeel, Mohamed A.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICS AND COMPUTERS IN SCIENCES AND IN INDUSTRY (MCSI 2016), 2016, : 172 - 177
  • [7] An Efficient Segmentation Algorithm for Arabic Handwritten Characters Recognition System
    Ali, Mohamed A.
    AFRO-EUROPEAN CONFERENCE FOR INDUSTRIAL ADVANCEMENT, AECIA 2014, 2015, 334 : 193 - 204
  • [8] RECOGNITION OF ARABIC CHARACTERS
    ALYOUSEFI, H
    UDPA, SS
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (08) : 853 - 857
  • [9] Recognition Based Segmentation of Connected Characters in Text Based CAPTCHAs
    Hussain, Rafaqat
    Gao, Hui
    Shaikh, Riaz Ahmed
    Soomro, Shazia Parveen
    PROCEEDINGS OF 2016 8TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2016), 2016, : 673 - 676
  • [10] Improved linear density technique for segmentation in Arabic handwritten text recognition
    Husam Ahmed Al Hamad
    Laith Abualigah
    Mohammad Shehab
    Khalil H. A. Al-Shqeerat
    Mohammad Otair
    Multimedia Tools and Applications, 2022, 81 : 28531 - 28558