OCRSpell: An interactive spelling correction system for OCR errors in text

被引:32
|
作者
Taghva K. [1 ]
Stofsky E. [1 ]
机构
[1] Information Science Research Institute, University of Nevada, Las Vegas, Las Vegas
关键词
Error correction; Information retrieval; OCR-Spell checkers; Scanning;
D O I
10.1007/PL00013558
中图分类号
学科分类号
摘要
In this paper, we describe a spelling correction system designed specifically for OCR-generated text that selects candidate words through the use of information gathered from multiple knowledge sources. This system for text correction is based on static and dynamic device mappings, approximate string matching, and n-gram analysis. Our statistically based, Bayesian system incorporates a learning feature that collects confusion information at the collection and document levels. An evaluation of the new system is presented as well. © 2001 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:125 / 137
页数:12
相关论文
共 50 条
  • [21] AUTOMATIC SPELLING CORRECTION IN SCIENTIFIC AND SCHOLARLY TEXT
    POLLOCK, JJ
    ZAMORA, A
    COMMUNICATIONS OF THE ACM, 1984, 27 (04) : 358 - 368
  • [22] Data Centric Domain Adaptation for Historical Text with OCR Errors
    Maerz, Luisa
    Schwetei, Stefan
    Poerner, Nina
    Roth, Benjamin
    Schuetze, Hinrich
    DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 748 - 761
  • [23] OCR Error Correction for Unconstrained Vietnamese Handwritten Text
    Nguyen, Quoc-Dung
    Le, Duc-Anh
    Zelinka, Ivan
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 132 - 138
  • [24] A MODEL AND A FAST ALGORITHM FOR MULTIPLE ERRORS SPELLING CORRECTION
    DU, MW
    CHANG, SC
    ACTA INFORMATICA, 1992, 29 (03) : 281 - 302
  • [25] COMBINED METHOD FOR DETECTION AND CORRECTION OF SPELLING-ERRORS
    KHARIN, NP
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1990, (08): : 25 - 30
  • [26] METHODS AND MEANS OF AUTOMATIC CORRECTION OF SPELLING-ERRORS
    SALMINA, NY
    KHODASHINSKII, IA
    NAUCHNO-TEKHNICHESKAYA INFORMATSIYA SERIYA 2-INFORMATSIONNYE PROTSESSY I SISTEMY, 1986, (10): : 25 - 28
  • [27] COLLECTION AND CHARACTERIZATION OF SPELLING-ERRORS IN SCIENTIFIC AND SCHOLARLY TEXT
    POLLOCK, JJ
    ZAMORA, A
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1983, 34 (01): : 51 - 58
  • [28] A Vietnamese Spelling Correction System
    Thien Hai Nguyen
    Manh Luong
    Dang Minh Nguyen
    Hung Bui
    Thinh Pham
    Nguyen Luong Tran
    Tuan Anh Luu
    Dinh Phung
    Khoi Minh Le
    Hieu Man
    Nguyen, Thien Huu
    Nguyen, Dat Quoc
    COMPANION PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023 COMPANION, 2023, : 158 - 161
  • [29] Tebyan: Interactive Spelling Correction Application for Quranic Verse
    Al-Mutlaq, Anhar
    2017 9TH IEEE-GCC CONFERENCE AND EXHIBITION (GCCCE), 2018, : 394 - 399
  • [30] Spelling Errors in Text Copying by Children With Dyslexia and ADHD Symptoms
    Re, Anna Maria
    Cornoldi, Cesare
    JOURNAL OF LEARNING DISABILITIES, 2015, 48 (01) : 73 - 82