PageRank on Wikipedia: Towards General Importance Scores for Entities

被引:21
|
作者
Thalhammer, Andreas [1 ]
Rettinger, Achim [1 ]
机构
[1] Karlsruhe Inst Technol, AIFB, Karlsruhe, Germany
来源
SEMANTIC WEB, ESWC 2016 | 2016年 / 9989卷
关键词
Wikipedia; DBpedia; PageRank; Link analysis; Page views; Rank correlation;
D O I
10.1007/978-3-319-47602-5_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Link analysis methods are used to estimate importance in graph-structured data. In that realm, the PageRank algorithm has been used to analyze directed graphs, in particular the link structure of the Web. Recent developments in information retrieval focus on entities and their relations (i.e., knowledge graph panels). Many entities are documented in the popular knowledge base Wikipedia. The cross-references within Wikipedia exhibit a directed graph structure that is suitable for computing PageRank scores as importance indicators for entities. In this work, we present different PageRank-based analyses on the link graph of Wikipedia and according experiments. We focus on the question whether some links-based on their context/position in the article text-can be deemed more important than others. In our variants, we change the probabilistic impact of links in accordance to their context/position on the page and measure the effects on the output of the PageRank algorithm. We compare the resulting rankings and those of existing systems with page-view-based rankings and provide statistics on the pairwise computed Spearman and Kendall rank correlations.
引用
收藏
页码:227 / 240
页数:14
相关论文
共 50 条
  • [41] WIKITAG: WIKIPEDIA-BASED KNOWLEDGE EMBEDDINGS TOWARDS IMPROVED ACOUSTIC EVENT CLASSIFICATION
    Zhang, Qin
    Tang, Qingming
    Kao, Chieh-Chi
    Sun, Ming
    Liu, Yang
    Wang, Chao
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 136 - 140
  • [42] WIKIPEDIA The Role, Importance, Content Creation Methods and Comparison of English, Bosnian, Serbian and Croatian Language Versions
    Zaric, Biljana
    BOSNIACA-JOURNAL OF THE NATIONAL AND UNIVERSITY LIBRARY OF BOSNIA AND HERZEGOVINA, 2009, (14): : 30 - 39
  • [43] Importance Analysis of Causative Nodes for Accident Chains of Railway Locomotive Operation Based on STPA-PageRank Method
    Wan, Ping
    Yang, Wei-Lun
    Luo, Jie-Wen
    Ma, Xiao-Feng
    PROMET-TRAFFIC & TRANSPORTATION, 2025, 37 (01): : 137 - 150
  • [44] Towards Improving Wikipedia as an Image-Rich Encyclopaedia through Analyzing Appropriateness of Images for an Article
    Zhang, Xinpeng
    Asano, Yasuhito
    Yoshikawa, Masatoshi
    WEB TECHNOLOGIES AND APPLICATIONS, 2011, 6612 : 200 - 212
  • [45] Robust circuitry-based scores of structural importance of human brain areas
    Hegedus, Daniel
    Grolmusz, Vince
    PLOS ONE, 2024, 19 (01):
  • [46] A Novel Approach to Rank Text-based Essays using Pagerank Method Towards Student's Motivational Element
    Arifin, M. Zainal
    Pee, Naim Che
    Herman, Nanna Suryana
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (09) : 151 - 158
  • [47] Eigenvectors of directed graphs and importance scores: dominance, T-Rank, and sink remedies
    Bjelland, J.
    Burgess, M.
    Canright, G.
    Engo-Monsen, K.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2010, 20 (01) : 98 - 151
  • [48] Eigenvectors of directed graphs and importance scores: dominance, T-Rank, and sink remedies
    J. Bjelland
    M. Burgess
    G. Canright
    K. Engø-Monsen
    Data Mining and Knowledge Discovery, 2010, 20 : 98 - 151
  • [49] 'WP2Cochrane', a tool linking Wikipedia to the Cochrane Library: Results of a bibliometric analysis evaluating article quality and importance
    Joorabchi, Arash
    Doherty, Cailbhe
    Dawson, Jennifer
    HEALTH INFORMATICS JOURNAL, 2020, 26 (03) : 1881 - 1897