An Improved Semantic Similarity Measure for Word Pairs

被引:6
作者
Cai, Songmei [1 ]
Lu, Zhao [1 ]
机构
[1] E China Normal Univ, Dept Comp Sci & Technol, Shanghai 200062, Peoples R China
来源
2010 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING: IC4E 2010, PROCEEDINGS | 2010年
关键词
Semantic Similarity; WordNet; DAG theory;
D O I
10.1109/IC4E.2010.20
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of measuring semantic similarity between word pairs has been considered as a fundamental operation in natural language processing, such as information retrieval, word sense disambiguation, etc. Nevertheless, developing a computational method capable of generating satisfactory results close to what humans would perceive is still a difficult task somewhat owed to the subjective nature of similarity. In this paper, we suggest an improved semantic similarity measure between words. It considers the structure of WordNet 3.0 based on DAG, and combines the improved distance-based measure and the information-based measure. The correlation value has been achieved between results by the proposed semantic similarity measure and human ratings reported by Miller and Charles for the dataset of 30 pairs of noun, which is higher than some other reported measures for the same dataset.
引用
收藏
页码:212 / 216
页数:5
相关论文
共 18 条
[1]  
[Anonymous], 1997, PROC 10 RES COMPUTAT
[2]  
Budanitsky A, 2006, COMPUT LINGUIST, V32, P13, DOI 10.1162/coli.2006.32.1.13
[3]  
Fellbaum C., 1998, WordNet, DOI DOI 10.7551/MITPRESS/7287.001.0001
[4]  
HONGXIAN T, 2006, P INT C COMM TECHN G, P1
[5]  
Kolb Peter., 2009, NODALIDA, V4, P81
[6]  
Leacock C., 1998, Combining Local Context and WordNet Similarity for Word Sense Identi cation, P265
[7]  
Lin D., 1998, An information-theoretic definition of similarity, P296
[8]  
MARTON Y, 2009, P 2009 C EMP METH NA, P775
[9]  
Patwardhan S., 2006, P EACL WORKSH MAK SE
[10]  
QIN P, 2009, NEW MEAS WORD SEM SI