A semantics-enhanced language model for unsupervised word sense disambiguation

被引:0
作者
Lin, Shou-De [1 ]
Verspoor, Karin [2 ]
机构
[1] Natl Taiwan Univ, Taipei, Taiwan
[2] Los Alamos Natl Lab, Los Alamos, NM 87544 USA
来源
COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING | 2008年 / 4919卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An N-gram language model aims at capturing statistical word order dependency information from corpora. Although the concept of language models has been applied extensively to handle a variety of NLP problems with reasonable success, the standard model does not incorporate semantic information, and consequently limits its applicability to semantic problems such as word sense disambiguation. We propose a framework that integrates semantic information into the language model schema, allowing a system to exploit both syntactic and semantic information to address NLP problems. Furthermore, acknowledging the limited availability of semantically annotated data, we discuss how the proposed model can be learned without annotated training examples. Finally, we report on a case study showing how the semantics-enhanced language model can be applied to unsupervised word sense disambiguation with promising results.
引用
收藏
页码:287 / +
页数:3
相关论文
共 18 条
  • [1] Banerjee S., 2003, P 18 INT JOINT C ART, V3, P805
  • [2] BAUM LE, 1972, INEQUALITIES, V627, P1
  • [3] Exploiting latent semantic information in statistical language modeling
    Bellegarda, JR
    [J]. PROCEEDINGS OF THE IEEE, 2000, 88 (08) : 1279 - 1296
  • [4] Brody S, 2006, COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, P97
  • [5] Brown P. F., 1992, Computational Linguistics, V18, P467
  • [6] Chien, 2006, COMPUT LINGUIST, V11, P37
  • [7] CUTTING D, 1992, THIRD CONFERENCE ON APPLIED NATURAL LANGUAGE PROCESSING, P133
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] GALLEY M, 2003, P 18 INT JOINT C ART, P1486
  • [10] GRIFFITHS T, 2004, P ADV NEUR INF PROC