Handwriting segmentation of unconstrained Oriya text

被引:29
|
作者
Tripathy, N. [1 ]
Pal, U. [1 ]
机构
[1] Indian Stat Inst, Comp Vis & Pattern Recognit Unit, Kolkata 700108, India
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2006年 / 31卷 / 6期
关键词
Indian language; Oriya script; character segmentation; handwriting recognition;
D O I
10.1007/BF02716894
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Segmentation of handwritten text into lines, words and characters is one of the important steps in the handwritten text recognition process. In this paper we propose a water reservoir concept-based scheme for segmentation of unconstrained Oriya handwritten text into individual characters. Here, at first, the text image is segmented into lines, and the lines are then segmented into individual words. For line segmentation, the document is divided into vertical stripes. Analysing the heights of the water reservoirs obtained from different components of the document, the width of a stripe is calculated. Stripe-wise horizontal histograms are then computed and the relationship of the peak-valley points of the histograms is used for line segmentation. Based on vertical projection profiles and structural features of Oriya characters, text lines are segmented into words. For character segmentation, at first, the isolated and connected ( touching) characters in a word are detected. Using structural, topological and water reservoir concept-based features, characters of the word that touch are then segmented. From experiments we have observed that the proposed "touching character" segmentation module has 96.7% accuracy for two-character touching strings.
引用
收藏
页码:755 / 769
页数:15
相关论文
共 50 条
  • [1] Handwriting segmentation of unconstrained oriya text
    Tripathy, N., Hitachi, Japan; IBM, USA; Fujitsu Laboratories, Japan; NEC, Japan; Toshiba, Japan (IEEE Computer Society):
  • [2] Handwriting segmentation of unconstrained Oriya text
    Tripathy, N
    Pal, U
    NINTH INTERNATIONAL WORKSHOP ON FRONTIERS IN HANDWRITING RECOGNITION, PROCEEDINGS, 2004, : 306 - 311
  • [3] Handwriting segmentation of unconstrained Oriya text
    N. Tripathy
    U. Pal
    Sadhana, 2006, 31 : 755 - 769
  • [4] A maximum-likelihood approach to segmentation-based recognition of unconstrained handwriting text
    Senda, S
    Yamada, K
    SIXTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, PROCEEDINGS, 2001, : 184 - 188
  • [5] A maximum-likelihood approach to segmentation-based recognition of unconstrained handwriting text
    Senda, Shuji
    Yamada, Keiji
    Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, 2001, 2001-January : 184 - 188
  • [6] Segmentation of Bangla unconstrained handwritten text
    Pal, U
    Datta, S
    SEVENTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2003, : 1128 - 1132
  • [7] Segmentation and recognition of continuous handwriting Chinese text
    Hong, C
    Loudon, G
    Wu, YM
    Zitserman, R
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1998, 12 (02) : 223 - 232
  • [8] The Algorithms For Segmentation Of Text-Lines In Handwriting Images
    Huo Liulei
    Moydin, Kamil
    Dawut, Abdusalam
    Hamdulla, Askar
    2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 919 - 922
  • [9] Off-line Handwriting Text Line Segmentation : A Review
    Razak, Zaidi
    Zulkiflee, Khansa
    Idris, Mohd Yamani Idna
    Tamil, Emran Mohd
    Noorzaily, Mohd
    Noor, Mohamed
    Salleh, Rosli
    Yaakob, Mohd
    Yusof, Zulkifli Mohd
    Yaacob, Mashkuri
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (07): : 12 - 20
  • [10] A new scheme for unconstrained handwritten text-line segmentation
    Alaei, Alireza
    Pal, Umapada
    Nagabhushan, P.
    PATTERN RECOGNITION, 2011, 44 (04) : 917 - 928