Measuring similarity and relatedness using multiple semantic relations in WordNet

被引:0
作者
Xinhua Zhu
Xuechen Yang
Yanyi Huang
Qingsong Guo
Bo Zhang
机构
[1] Guangxi Normal University,Guangxi Key Lab of Multi
[2] Hezhou University,source Information Mining and Security
来源
Knowledge and Information Systems | 2020年 / 62卷
关键词
Semantic similarity; Semantic relatedness; Nearest common descendant; Multiple semantic complement; WordNet;
D O I
暂无
中图分类号
学科分类号
摘要
Semantic similarity and relatedness computation has attracted an increasing amount of attention among researchers. The majority of previous studies, including edge-based and information content-based methods, rely on a single semantic relationship in WordNet such as the “is-a” relation. However, a performance ceiling may have been created by semantic unicity and inadequate calculation in solely “is-a” relation-based measurements, i.e., the computed results for some word pairs are too small and significantly deviate from human judgments. For this problem, we propose the following solutions: (1) We introduce the notion of the nearest common descendant to provide a supplement for commonalities between concepts according to genetics theory. (2) We design various targeted methods for different incomplete semantic relations. Therefore, various semantic relations can participate in similarity and relatedness computations in their most appropriate manners. (3) We utilize the cross-use of incomplete semantic relations similar-to and antonymy to solve the challenge of adjective and adverb similarity/relatedness measurements in WordNet. (4) We propose a targeted independent computation and largest contribution aggregation method to break through the performance ceiling of similarity/relatedness measurements based on single “is-a” relations. We conduct evaluations of our proposed model using seven extensively employed datasets. These evaluations indicate that our method significantly improves the performance of the existing methods based on single “is-a” relations. Their best Pearson coefficient with human judgments on both the MC30 and RG65 is increased to 0.9. With the development and enrichment of semantic relations in WordNet, our proposed model can be expected to have a more prominent role.
引用
收藏
页码:1539 / 1569
页数:30
相关论文
共 100 条
  • [1] Zhu GG(2018)Exploiting semantic similarity for named entity disambiguation in knowledge graphs Expert Syst Appl 101 8-24
  • [2] Iglesias CA(2018)Using semantic similarity to reduce wrong labels in distant supervision for relation extraction Inf Process Manag 54 593-608
  • [3] Ru C(2015)Using knowledge based relatedness for information retrieval Knowl Inf Syst 44 689-718
  • [4] Tang J(1977)Features of similarity Psychol Rev 84 327-352
  • [5] Li S(1991)Contextual correlates of semantic similarity Lang Cogn Process 6 1-28
  • [6] Xie S(2018)An efficient path computing model for measuring semantic similarity using edge and density Knowl Inf Syst 55 79-111
  • [7] Wang T(2015)A WordNet-based semantic similarity measurement combining edge-counting and information content theory Eng Appl Artif Intel 39 80-88
  • [8] Otegi A(2014)A new semantic relatedness measurement using WordNet features Knowl Inf Syst 41 467-497
  • [9] Arregi X(2003)An approach for measuring semantic similarity between words using multiple information sources IEEE Trans Knowl Data Eng 15 871-882
  • [10] Ansa O(2012)A new model of information content based on concept’s topology for measuring semantic similarity in WordNet Int J Grid Distrib Comput 5 81-94