Concept vector for semantic similarity and relatedness based on WordNet structure

被引:41
作者
Liu, Hongzhe [1 ,2 ,3 ]
Bao, Hong [1 ,2 ,3 ]
Xu, De [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
[2] Beijing Jiaotong Univ, Inst Comp Sci & Engn, Beijing, Peoples R China
[3] Beijing Union Univ, Beijing Key Lab Informat Serv Engn, Beijing, Peoples R China
关键词
Concept similarity; Concept relatedness; Concept vector model; Hierarchical concept tree; Hierarchical concept graph; WordNet; CONTEXT;
D O I
10.1016/j.jss.2011.08.029
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We define WordNet based hierarchy concept tree (HCT) and hierarchy concept graph (HCG), HCT contains hyponym/hypernym kind of relation in WordNet while HCG has more meronym/holonym kind of edges than in HCT, and present an advanced concept vector model for generalizing standard representations of concept similarity in terms of WordNet-based HCT. In this model, each concept node in the hierarchical tree has ancestor and descendent concept nodes composing its relevancy nodes, thus a concept node is represented as a concept vector according to its relevancy nodes' local density and the similarity of the two concepts is obtained by computing the cosine similarity of their vectors. In addition, the model is adjustable in terms of multiple descendent concept nodes. This paper also provides a method by which this concept vector may be applied with regard to HCG into HCT. With this model, semantic similarity and relatedness are computed based on HCT and HCG. The model contains structural information inherent to and hidden in the HCT and HCG. Our experiments showed that this model compares favorably to others and is flexible in that it can make comparisons between any two concepts in a WordNet-like structure without relying on any additional dictionary or corpus information. (C) 2011 Elsevier Inc. All rights reserved.
引用
收藏
页码:370 / 381
页数:12
相关论文
共 25 条
[1]  
Alexander B., 2006, COMPUTATIONAL LINGUI, V32
[2]   A graph modeling of semantic similarity between words [J].
Alvarez, Marco A. ;
Lim, SeungJin .
ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, :355-+
[3]  
[Anonymous], S PATWARDHAN T PEDER
[4]  
Banerjee S., 2003, P 18 INT JOINT C ART, P805
[5]   An Improved Semantic Similarity Measure for Word Pairs [J].
Cai, Songmei ;
Lu, Zhao .
2010 INTERNATIONAL CONFERENCE ON E-EDUCATION, E-BUSINESS, E-MANAGEMENT AND E-LEARNING: IC4E 2010, PROCEEDINGS, 2010, :212-216
[6]  
Hirst G, 1998, LANG SPEECH & COMMUN, P305
[7]  
Jiang J., 1997, P COLING TAIW
[8]  
Kim J, 2006, P 15 ACM INT C INF K, P483
[9]  
Leacock C, 1998, LANG SPEECH & COMMUN, P265
[10]  
Lei L., 2009, INT C ART INT COMP I, V3, P72