An improved error model for noisy channel spelling correction

被引:200
作者
Brill, E [1 ]
Moore, RC [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE | 2000年
关键词
D O I
10.3115/1075218.1075255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The noisy channel model has been applied to a wide range of problems, including spelling correction. These models consist of two components: a source model and a channel model. Very little research has gone into improving the channel model for spelling correction. This paper describes a new channel model for spelling correction, based on generic string to string edits. Using this model gives significant performance improvements compared to previously proposed models.
引用
收藏
页码:286 / 293
页数:8
相关论文
共 14 条
  • [1] Church K. W., 1991, Stat. Comput, V1, P93, DOI DOI 10.1007/BF01889984
  • [2] Damerau Frederick, 1964, COMMUN ACM, V7, P659
  • [3] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [4] A Winnow-based approach to context-sensitive spelling correction
    Golding, AR
    Roth, D
    [J]. MACHINE LEARNING, 1999, 34 (1-3) : 107 - 130
  • [5] APPROXIMATE STRING MATCHING
    HALL, PAV
    DOWLING, GR
    [J]. COMPUTING SURVEYS, 1980, 12 (04) : 381 - 402
  • [6] Jurafsky D., 2000, Speech and Language Processing. An Introduction to Natural language Processing, Computational Linguistics
  • [7] KUKICH K, 1992, COMPUT SURV, V24, P377
  • [8] Levenshtein V.I., 1966, SOV PHYS DOKL, V10, DOI DOI 10.1109/TVCG.2012.323
  • [9] CONTEXT BASED SPELLING CORRECTION
    MAYS, E
    DAMERAU, FJ
    MERCER, RL
    [J]. INFORMATION PROCESSING & MANAGEMENT, 1991, 27 (05) : 517 - 522
  • [10] OFLAZER K, 1994, SPELLING CORRECTION