An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links

被引:0
作者
Xinhua Zhu
Qingsong Guo
Bo Zhang
Fei Li
机构
[1] Guangxi Normal University,Guangxi Key Lab of Multi
[2] Hezhou University,source Information Mining & Security
[3] Beijing Institute of Technology,School of Mathematics & Computer Science
来源
Applied Intelligence | 2019年 / 49卷
关键词
Semantic relatedness; Link vector; Vector similarity metric; Disambiguation; Wikipedia;
D O I
暂无
中图分类号
学科分类号
摘要
The measurement of the semantic relatedness between concepts is an important fundamental research topic in natural language processing. The link-based model is the most promising relatedness method in Wikipedia-based measures because its manually defined links in Wikipedia are refined and close to the semantics of humans. This paper proposes a Wikipedia two-way link model to extend the existing Wikipedia one-way out-link model, which has a low dimension and a high efficiency, as well as being easy to implement and repeat. First, this model utilizes the out-links and in-links of concepts in Wikipedia to combine into a bidirectional link vector for concept semantic interpreter and uses a TF*IDF-based bidirectional weight method to uniformly calculate the strength of the mutual association between a given concept and its out-link or in-link concept. Second, we propose a disambiguation strategy based on the social awareness of senses that directly sorts the out-links within a disambiguation page in the order in which they occur in the disambiguation page and adopts an adjustable threshold to determine how many senses will be selected. Moreover, we also propose new vector similarity metrics based on logarithm and exponent to improve the comprehensive performance of the semantic relatedness measurements based on Wikipedia links. The experimental results on some well-recognized datasets demonstrate that our model surpasses the existing popular Naïve Explicit Semantic Analysis (Naïve-ESA) and Wikipedia Out-Link vector-based Measure (WOLM) methods in the current Wikipedia versions and that our bidirectional link model significantly improves the performance of the existing one-way link model in practical applications.
引用
收藏
页码:3708 / 3730
页数:22
相关论文
共 50 条
  • [31] Measuring similarity and relatedness using multiple semantic relations in WordNet
    Xinhua Zhu
    Xuechen Yang
    Yanyi Huang
    Qingsong Guo
    Bo Zhang
    Knowledge and Information Systems, 2020, 62 : 1539 - 1569
  • [32] Computing text semantic relatedness using the contents and links of a hypertext encyclopedia
    Yazdani, Majid
    Popescu-Belis, Andrei
    ARTIFICIAL INTELLIGENCE, 2013, 194 : 176 - 202
  • [33] Exploiting Level-Wise Category Links for Semantic Relatedness Computing
    Zheng, Hai-Tao
    Wu, Wenzhen
    Jiang, Yong
    Xia, Shu-Tao
    NEURAL INFORMATION PROCESSING (ICONIP 2014), PT II, 2014, 8835 : 556 - 564
  • [34] Evaluating semantic similarity and relatedness between concepts by combining taxonomic and non-taxonomic semantic features of WordNet and Wikipedia
    Hussain, Muhammad Jawad
    Bai, Heming
    Wasti, Shahbaz Hassan
    Huang, Guangjian
    Jiang, Yuncheng
    INFORMATION SCIENCES, 2023, 625 : 673 - 699
  • [35] Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness
    Ben Aouicha, Mohamed
    Taieb, Mohamed Ali Hadj
    Ben Hamadou, Abdelmajid
    APPLIED INTELLIGENCE, 2016, 45 (02) : 475 - 511
  • [36] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    Qureshi, M. Atif
    Younus, Arjumand
    O'Riordan, Colm
    Pasi, Gabriella
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2018, 9 (05) : 1403 - 1413
  • [37] A wikipedia-based semantic relatedness framework for effective dimensions classification in online reputation management
    M. Atif Qureshi
    Arjumand Younus
    Colm O’Riordan
    Gabriella Pasi
    Journal of Ambient Intelligence and Humanized Computing, 2018, 9 : 1403 - 1413
  • [38] Taxonomy-based information content and wordnet-wiktionary-wikipedia glosses for semantic relatedness
    Mohamed Ben Aouicha
    Mohamed Ali Hadj Taieb
    Abdelmajid Ben Hamadou
    Applied Intelligence, 2016, 45 : 475 - 511
  • [39] Measuring Semantic Relatedness with Knowledge Association Network
    Li, Jiapeng
    Chen, Wei
    Gu, Binbin
    Fang, Junhua
    Li, Zhixu
    Zhao, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 676 - 691
  • [40] A graph-based semantic relatedness assessment method combining wikipedia features
    Li, Pu
    Xiao, Bao
    Ma, Wenjun
    Jiang, Yuncheng
    Zhang, Zhifeng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 268 - 281