An efficient approach for measuring semantic relatedness using Wikipedia bidirectional links

被引:0
|
作者
Xinhua Zhu
Qingsong Guo
Bo Zhang
Fei Li
机构
[1] Guangxi Normal University,Guangxi Key Lab of Multi
[2] Hezhou University,source Information Mining & Security
[3] Beijing Institute of Technology,School of Mathematics & Computer Science
来源
Applied Intelligence | 2019年 / 49卷
关键词
Semantic relatedness; Link vector; Vector similarity metric; Disambiguation; Wikipedia;
D O I
暂无
中图分类号
学科分类号
摘要
The measurement of the semantic relatedness between concepts is an important fundamental research topic in natural language processing. The link-based model is the most promising relatedness method in Wikipedia-based measures because its manually defined links in Wikipedia are refined and close to the semantics of humans. This paper proposes a Wikipedia two-way link model to extend the existing Wikipedia one-way out-link model, which has a low dimension and a high efficiency, as well as being easy to implement and repeat. First, this model utilizes the out-links and in-links of concepts in Wikipedia to combine into a bidirectional link vector for concept semantic interpreter and uses a TF*IDF-based bidirectional weight method to uniformly calculate the strength of the mutual association between a given concept and its out-link or in-link concept. Second, we propose a disambiguation strategy based on the social awareness of senses that directly sorts the out-links within a disambiguation page in the order in which they occur in the disambiguation page and adopts an adjustable threshold to determine how many senses will be selected. Moreover, we also propose new vector similarity metrics based on logarithm and exponent to improve the comprehensive performance of the semantic relatedness measurements based on Wikipedia links. The experimental results on some well-recognized datasets demonstrate that our model surpasses the existing popular Naïve Explicit Semantic Analysis (Naïve-ESA) and Wikipedia Out-Link vector-based Measure (WOLM) methods in the current Wikipedia versions and that our bidirectional link model significantly improves the performance of the existing one-way link model in practical applications.
引用
收藏
页码:3708 / 3730
页数:22
相关论文
共 50 条
  • [21] Comparing Semantic Relatedness between Word Pairs in Portuguese Using Wikipedia
    Granada, Roger
    Trojahn, Cassia
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, 2014, 8775 : 170 - 175
  • [22] An Efficient Approach for Semantic Relatedness Evaluation based on Semantic Neighborhood
    Lopes, Alcides
    Alvarenga, Renata
    Carbonera, Joel
    Abel, Mara
    2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 316 - 323
  • [23] A text mining approach for measuring semantic relatedness using support vector machines
    Lee, CH
    Yang, HC
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS: I, 2004, : 320 - 323
  • [24] Systematic Approach for Measuring Semantic Relatedness between Ontologies
    Elfaki, Abdelrahman Osman
    Alfaifi, Yousef H.
    ELECTRONICS, 2023, 12 (06)
  • [25] Measuring Semantic Similarity between Words Using Wikipedia
    Lu Zhiqiang
    Shao Werimin
    Yu Zhenhua
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 251 - +
  • [26] A Self-Adaptive Explicit Semantic Analysis Method for Computing Semantic Relatedness using Wikipedia
    Wang, Weiping
    Chen, Peng
    Liu, Bowen
    2008 INTERNATIONAL SEMINAR ON FUTURE INFORMATION TECHNOLOGY AND MANAGEMENT ENGINEERING, PROCEEDINGS, 2008, : 3 - 6
  • [27] EVALUATING SEMANTIC RELATEDNESS USING WIKIPEDIA-BASED REPRESENTATIVE FEATURES ANALYSIS
    Cui, Qing-jun
    Zhang, Hui
    Liu, Rui
    2011 INTERNATIONAL CONFERENCE ON INSTRUMENTATION, MEASUREMENT, CIRCUITS AND SYSTEMS (ICIMCS 2011), VOL 3: COMPUTER-AIDED DESIGN, MANUFACTURING AND MANAGEMENT, 2011, : 467 - 472
  • [28] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8786 : 523 - 533
  • [29] A Hybrid Model for Learning Semantic Relatedness Using Wikipedia-Based Features
    Jabeen, Shahida
    Gao, Xiaoying
    Andreae, Peter
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2014, PT I, 2014, 8786 : 523 - 533
  • [30] WSR: A semantic relatedness measure based on Wikipedia structure
    Sun, C.-C. (bigchansuns@163.com), 1600, Science Press (35):