Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques

被引:0
作者
Florentina Hristea
Marius Popescu
Monica Dumitrescu
机构
[1] University of Bucharest,
来源
Artificial Intelligence Review | 2008年 / 30卷
关键词
Word sense disambiguation; Unsupervised disambiguation; Knowledge-based disambiguation; Bayesian classification; The EM algorithm; WordNet;
D O I
暂无
中图分类号
学科分类号
摘要
This paper aims to fully present a new word sense disambiguation method that has been introduced in Hristea and Popescu (Fundam Inform 91(3–4):547–562, 2009) and so far tested in the case of adjectives (Hristea and Popescu in Fundam Inform 91(3–4):547–562, 2009) and verbs (Hristea in Int Rev Comput Softw 4(1):58–67, 2009). We hereby extend the method to the case of nouns and draw conclusions regarding its performance with respect to all these parts of speech. The method lies at the border between unsupervised and knowledge-based techniques. It performs unsupervised word sense disambiguation based on an underlying Naïve Bayes model, while using WordNet as knowledge source for feature selection. The performance of the method is compared to that of previous approaches that rely on completely different feature sets. Test results for all involved parts of speech show that feature selection using a knowledge source of type WordNet is more effective in disambiguation than local type features (like part-of-speech tags) are.
引用
收藏
相关论文
共 22 条
[1]  
Dempster A(1977)Maximum likelihood from incomplete data via the EM algorithm J Royal Stat Soc B 39 1-38
[2]  
Laird N(1992)A method for disambiguating word senses in a large corpus Comp Humanit 26 415-439
[3]  
Rubin D(1995)Discrimination decisions for 100,000—dimensional space Ann Oper Res 55 323-344
[4]  
Gale WA(2009)Recent advances concerning the usage of the Naïve Bayes Model in unsupervised word sense disambiguation Int Rev Comput Softw 4 58-67
[5]  
Church KW(2009)Adjective sense disambiguation at the border between unsupervised and knowledge-based techniques Fundam Inform 91 547-562
[6]  
Yarowsky D(1990)Nouns in WordNet: a lexical inheritance system Int J Lexicography 3 245-264
[7]  
Gale WA(1990)WordNet: an on-line lexical database J Lexicography 3 234-244
[8]  
Church KW(1995)WordNet: a lexical database Commun ACM 38 39-41
[9]  
Yarowsky D(2006)WordNet nouns: classes and instances Comput Linguist 32 1-3
[10]  
Hristea F(1998)Automatic word-sense discrimination Comput Linguist 24 97-123