Script Separation in Machine Printed Bilingual (Devnagari and Gurumukhi) Documents Using Morphological Approach

被引:0
作者
Singh, Sukhvir [1 ]
Kumar, Anil [1 ]
Shaw, Dinesh Kr. [1 ]
Ghosh, D. [1 ]
机构
[1] Indian Inst Technol Roorkee, Dept Elect & Commun Engn, Roorkee 247667, Uttar Pradesh, India
来源
2014 TWENTIETH NATIONAL CONFERENCE ON COMMUNICATIONS (NCC) | 2014年
关键词
Document analysis; script identification; bilingual documents; feature extraction; morphological operations; RECOGNITION; IDENTIFICATION; IMAGES; SYSTEM;
D O I
暂无
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In this paper, a bilingual script recognition system is developed to identify and separate out texts written in Devnagari and Gurumukhi scripts. It is observed that vertical half strokes and horizontal strokes are predominantly used in Gurumukhi script compared to Devnagari. On the basis of this observation, we develop a morphological image processing based scheme to extract vertical and horizontal strokes in a document image. The two script regions are subsequently separated out on the basis of the density of these strokes in different regions of the document. Experimental results demonstrate the effectiveness of our proposed scheme. However, the proposed scheme works well only with machine printed documents.
引用
收藏
页数:5
相关论文
共 13 条
  • [1] Choudhury S., 2000, P IND C VIS GRAPH IM
  • [2] Davessar N. M., 2003, P 32 APPL IM PATT RE, P169
  • [3] Dhandra BV, 2006, INT C PATT RECOG, P950
  • [4] Dhandra B.V., INT J COMPUTER SCI S, V1, P41
  • [5] Script Recognition-A Review
    Ghosh, Debashis
    Dube, Tulika
    Shivaprasad, Adamane P.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (12) : 2142 - 2161
  • [6] Joshi GD, 2006, LECT NOTES COMPUT SC, V3872, P255
  • [7] Lehal GS, 2000, INT C PATT RECOG, P557, DOI 10.1109/ICPR.2000.906135
  • [8] Manthalkar R., 1997, IEEE T PATTERN ANAL, V19, P160
  • [9] Pal U., 2006, Vivek, V16, P26
  • [10] Indian script character recognition: a survey
    Pal, U
    Chaudhuri, BB
    [J]. PATTERN RECOGNITION, 2004, 37 (09) : 1887 - 1899