Evaluating Word Sense Induction and Disambiguation Methods

被引:7
|
作者
Klapaftis, Ioannis P. [1 ]
Manandhar, Suresh [2 ]
机构
[1] Microsoft Corp, Redmond, WA 98052 USA
[2] Univ York, Dept Comp Sci, York YO10 5DD, N Yorkshire, England
基金
英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
Word Sense Induction; Word Sense Disambiguation; Lexical Semantics;
D O I
10.1007/s10579-012-9205-0
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Word Sense Induction (WSI) is the task of identifying the different uses (senses) of a target word in a given text in an unsupervised manner, i.e. without relying on any external resources such as dictionaries or sense-tagged data. This paper presents a thorough description of the SemEval-2010 WSI task and a new evaluation setting for sense induction methods. Our contributions are two-fold: firstly, we provide a detailed analysis of the Semeval-2010 WSI task evaluation results and identify the shortcomings of current evaluation measures. Secondly, we present a new evaluation setting by assessing participating systems' performance according to the skewness of target words' distribution of senses showing that there are methods able to perform well above the Most Frequent Sense (MFS) baseline in highly skewed distributions.
引用
收藏
页码:579 / 605
页数:27
相关论文
empty
未找到相关数据