An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links

被引:0
作者
Xinhua Zhu
Qingsong Guo
Bo Zhang
Fei Li
机构
[1] Guangxi Normal University,Guangxi Key Lab of Multi
[2] Hezhou University,source Information Mining & Security
[3] Beijing Institute of Technology,School of Mathematics & Computer Science
来源
Applied Intelligence | 2019年 / 49卷
关键词
Semantic relatedness; Link vector; Vector similarity metric; Disambiguation; Wikipedia;
D O I
暂无
中图分类号
学科分类号
摘要
The measurement of the semantic relatedness between concepts is an important fundamental research topic in natural language processing. The link-based model is the most promising relatedness method in Wikipedia-based measures because its manually defined links in Wikipedia are refined and close to the semantics of humans. This paper proposes a Wikipedia two-way link model to extend the existing Wikipedia one-way out-link model, which has a low dimension and a high efficiency, as well as being easy to implement and repeat. First, this model utilizes the out-links and in-links of concepts in Wikipedia to combine into a bidirectional link vector for concept semantic interpreter and uses a TF*IDF-based bidirectional weight method to uniformly calculate the strength of the mutual association between a given concept and its out-link or in-link concept. Second, we propose a disambiguation strategy based on the social awareness of senses that directly sorts the out-links within a disambiguation page in the order in which they occur in the disambiguation page and adopts an adjustable threshold to determine how many senses will be selected. Moreover, we also propose new vector similarity metrics based on logarithm and exponent to improve the comprehensive performance of the semantic relatedness measurements based on Wikipedia links. The experimental results on some well-recognized datasets demonstrate that our model surpasses the existing popular Naïve Explicit Semantic Analysis (Naïve-ESA) and Wikipedia Out-Link vector-based Measure (WOLM) methods in the current Wikipedia versions and that our bidirectional link model significantly improves the performance of the existing one-way link model in practical applications.
引用
收藏
页码:3708 / 3730
页数:22
相关论文
共 50 条
  • [41] Graph-Based Domain-Specific Semantic Relatedness from Wikipedia
    Sajadi, Armin
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 381 - 386
  • [42] Semantic concept model using Wikipedia semantic features
    Saif, Abdulgabbar
    Omar, Nazlia
    Ab Aziz, Mohd Juzaiddin
    Zainodin, Ummi Zakiah
    Salim, Naomie
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (04) : 526 - 551
  • [43] Fusing distributional and experiential information for measuring semantic relatedness
    Neuman, Yair
    Assa'f, Dan
    Cohen, Yohai
    INFORMATION FUSION, 2013, 14 (03) : 281 - 287
  • [44] Research on Measuring Semantic Correlation Based on the Wikipedia Hyperlink Network
    Ye, Feiyue
    Zhang, Feng
    INTERNATIONAL JOURNAL OF SOFTWARE INNOVATION, 2013, 1 (03) : 1 - 11
  • [45] Research On Measuring Semantic Correlation Based On The Wikipedia Hyperlink Network
    Ye, Feiyue
    Zhang, Feng
    Luo, Xiangfeng
    Xu, Lingyu
    2013 IEEE/ACIS 12TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2013, : 309 - 314
  • [46] Semantic tag recommendation based on associated words exploiting the interwiki links of Wikipedia
    Hong, Hyun-Ki
    Kim, Gun-Woo
    Lee, Dong-Ho
    JOURNAL OF INFORMATION SCIENCE, 2018, 44 (03) : 298 - 313
  • [47] Querying linked data graphs using semantic relatedness: A vocabulary independent approach
    Freitas, Andre
    Oliveira, Joao Gabriel
    O'Riain, Sean
    da Silva, Joao C. P.
    Curry, Edward
    DATA & KNOWLEDGE ENGINEERING, 2013, 88 : 126 - 141
  • [48] Controversy detection in Wikipedia using semantic dissimilarity
    Jhandir, M. Zeeshan
    Tenvir, Ali
    On, Byung-Won
    Lee, Ingyu
    Choi, Gyu Sang
    INFORMATION SCIENCES, 2017, 418 : 581 - 600
  • [49] Using Semantic Models to Analyze Wikipedia Articles
    Chen, Lin-Chih
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 170 - 176
  • [50] A Novel Approach to Managing the Dynamic Nature of Semantic Relatedness
    Choi, Youngseok
    Oh, Jungsuk
    Park, Jinsoo
    JOURNAL OF DATABASE MANAGEMENT, 2016, 27 (02) : 1 - 26