Integrating knowledge sources in Devanagari text recognition system

被引:45
作者
Bansal, V [1 ]
Sinha, RMK
机构
[1] Indian Inst Technol, Dept Indistrial & Management Engn, Kanpur 208016, Uttar Pradesh, India
[2] Indian Inst Technol, Dept Comp Sci & Engn, Kanpur 208016, Uttar Pradesh, India
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS | 2000年 / 30卷 / 04期
关键词
Devanagari document processing; knowledge-based systems; optical character recognition;
D O I
10.1109/3468.852443
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role. This is the underlying philosophy of the Devanagari document recognition system described in this work. The knowledge sources we use are mostly statistical in nature or in the form of a word dictionary tailored specifically for optical character recognition (OCR). We do not perform any reasoning on these. However, we explore their relative importance and role in the hierarchy. Some of the knowledge sources are acquired a priori by an automated training process while others are extracted from the text as it is processed. A complete Devanagari OCR system has been designed and tested with real-life printed documents of varying size and font. Most of the documents used were photocopies of the original. A performance of approximately 90% correct recognition is achieved.
引用
收藏
页码:500 / 505
页数:6
相关论文
共 24 条
  • [1] BANSAL V, 1999, P INT C DOC AN REC I
  • [2] BANSAL V, 1996, P INT C INF SYST AN
  • [3] Bansal V., 1998, P IND C COMP VIS GRA
  • [4] BAOCHANG P, 1989, COMPUTER RECOGNITION, P37
  • [5] BAYER T, 1992, STRUCTURED DOCUMENT
  • [6] CASEY RG, 1990, IBM SYST J, V29
  • [7] OPTICAL CHARACTER-RECOGNITION BY THE METHOD OF MOMENTS
    CASH, GL
    HATAMIAN, M
    [J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1987, 39 (03): : 291 - 310
  • [8] A complete printed Bangla OCR system
    Chaudhuri, BB
    Pal, U
    [J]. PATTERN RECOGNITION, 1998, 31 (05) : 531 - 549
  • [9] Chaudhuri BB, 1997, PROC INT CONF DOC, P1011, DOI 10.1109/ICDAR.1997.620662
  • [10] HO TK, 1994, IEEE T PATTERN ANAL, V16, P66, DOI 10.1109/34.273716