State of the art versus classical clustering for unsupervised word sense disambiguation

被引:8
|
作者
Popescu, Marius
Hristea, Florentina
机构
[1] C.P. 010014 Bucharest, Academiei 14, Str.
关键词
Word sense disambiguation; Unsupervised disambiguation; Bayesian classification; The EM algorithm; WordNet; Spectral clustering; CORPUS;
D O I
10.1007/s10462-010-9193-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper ultimately discusses the importance of the clustering method used in unsupervised word sense disambiguation. It illustrates the fact that a powerful clustering technique can make up for lack of external knowledge of all types. It argues that feature selection does not always improve disambiguation results, especially when using an advanced, state of the art method, hereby exemplified by spectral clustering. Disambiguation results obtained when using spectral clustering in the case of the main parts of speech (nouns, adjectives, verbs) are compared to those of the classical clustering method given by the Na < ve Bayes model. In the case of unsupervised word sense disambiguation with an underlying Na < ve Bayes model feature selection performed in two completely different ways is surveyed. The type of feature selection providing the best results (WordNet-based feature selection) is equally being used in the case of spectral clustering. The conclusion is that spectral clustering without feature selection (but using its own feature weighting) produces superior disambiguation results in the case of all parts of speech.
引用
收藏
页码:241 / 264
页数:24
相关论文
共 50 条
  • [1] State of the art versus classical clustering for unsupervised word sense disambiguation
    Marius Popescu
    Florentina Hristea
    Artificial Intelligence Review, 2011, 35 : 241 - 264
  • [2] Unsupervised word sense disambiguation with N-gram features
    Preotiuc-Pietro, Daniel
    Hristea, Florentina
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 41 (02) : 241 - 260
  • [3] Unsupervised word sense disambiguation with N-gram features
    Daniel Preotiuc-Pietro
    Florentina Hristea
    Artificial Intelligence Review, 2014, 41 : 241 - 260
  • [4] A clustering-based Approach for Unsupervised Word Sense Disambiguation
    Martin-Wanton, Tamara
    Berlanga-Llavori, Rafael
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2012, (49): : 49 - 56
  • [5] Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques
    Hristea, Florentina
    Popescu, Marius
    Dumitrescu, Monica
    ARTIFICIAL INTELLIGENCE REVIEW, 2008, 30 (1-4) : 67 - 86
  • [6] Performing word sense disambiguation at the border between unsupervised and knowledge-based techniques
    Florentina Hristea
    Marius Popescu
    Monica Dumitrescu
    Artificial Intelligence Review, 2008, 30
  • [7] Unsupervised Word Sense Disambiguation Using The WWW
    Klapaftis, Ioannis P.
    Manandhar, Suresh
    STAIRS 2006, 2006, 142 : 174 - 183
  • [8] Unsupervised Word Sense Disambiguation with Multilingual Representations
    Fernandez-Ordonez, Erwin
    Mihalcea, Rada
    Hassan, Samer
    LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 847 - 851
  • [9] Word Sense Disambiguation in Bengali: an Unsupervised Approach
    Pal, Alok Ranjan
    Saha, Diganta
    PROCEEDINGS OF THE 2017 IEEE SECOND INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND COMMUNICATION TECHNOLOGIES (ICECCT), 2017,
  • [10] Multilingual versus monolingual word sense disambiguation
    Ion, Radu
    Tufis, Dan
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2009, 12 (2-3) : 113 - 124