OCRSpell: An interactive spelling correction system for OCR errors in text

被引:32
|
作者
Taghva K. [1 ]
Stofsky E. [1 ]
机构
[1] Information Science Research Institute, University of Nevada, Las Vegas, Las Vegas
关键词
Error correction; Information retrieval; OCR-Spell checkers; Scanning;
D O I
10.1007/PL00013558
中图分类号
学科分类号
摘要
In this paper, we describe a spelling correction system designed specifically for OCR-generated text that selects candidate words through the use of information gathered from multiple knowledge sources. This system for text correction is based on static and dynamic device mappings, approximate string matching, and n-gram analysis. Our statistically based, Bayesian system incorporates a learning feature that collects confusion information at the collection and document levels. An evaluation of the new system is presented as well. © 2001 Springer-Verlag Berlin Heidelberg.
引用
收藏
页码:125 / 137
页数:12
相关论文
共 50 条
  • [31] Automated Spelling Correction for Clinical Text Mining in Russian
    Balabaeva, Ksenia
    Funkner, Anastasia
    Kovalchuk, Sergey
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 43 - 47
  • [32] Post-ocr text correction for Bulgarian historical documents
    Beshirov, Angel
    Dobreva, Milena
    Dimitrov, Dimitar
    Hardalov, Momchil
    Koychev, Ivan
    Nakov, Preslav
    INTERNATIONAL JOURNAL ON DIGITAL LIBRARIES, 2025, 26 (01)
  • [33] OCR post-correction for detecting adversarial text images
    Imam, Niddal H.
    Vassilakis, Vassilios G.
    Kolovos, Dimitris
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 66
  • [34] Text Detection and Post-OCR Correction in Engineering Documents
    Francois, Mathieu
    Eglin, Veronique
    Biou, Maxime
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 726 - 740
  • [35] Post-ocr text correction for Bulgarian historical documentsPost-OCR text correction for Bulgarian historical documentsA. Beshirov et al.
    Angel Beshirov
    Milena Dobreva
    Dimitar Dimitrov
    Momchil Hardalov
    Ivan Koychev
    Preslav Nakov
    International Journal on Digital Libraries, 2025, 26 (1)
  • [36] TEXT REVISION - DETECTION AND CORRECTION OF ERRORS
    HACKER, DJ
    PLUMB, C
    BUTTERFIELD, EC
    QUATHAMER, D
    HEINEKEN, E
    JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1994, 86 (01) : 65 - 78
  • [37] A LOGICAL FRAMEWORK FOR THE CORRECTION OF SPELLING-ERRORS IN ELECTRONIC DOCUMENTS
    BERGHEL, HL
    INFORMATION PROCESSING & MANAGEMENT, 1987, 23 (05) : 477 - 494
  • [38] Detection and Correction of Non Word Spelling Errors in Hindi Language
    Jain, Amita
    Jain, Minni
    2014 INTERNATIONAL CONFERENCE ON DATA MINING AND INTELLIGENT COMPUTING (ICDMIC), 2014,
  • [39] Spelling Errors in text copying and in dictation by children with ADHD symptoms and controls
    Dovigo, Valentina
    Re, Anna M.
    PSICOLOGIA CLINICA DELLO SVILUPPO, 2013, 17 (03) : 511 - 520
  • [40] From spelling correction to text cleaning - Using context information
    Schierle, Martin
    Schulz, Sascha
    Ackermann, Markus
    DATA ANALYSIS, MACHINE LEARNING AND APPLICATIONS, 2008, : 397 - +