PageRank on Wikipedia: Towards General Importance Scores for Entities

被引:21
|
作者
Thalhammer, Andreas [1 ]
Rettinger, Achim [1 ]
机构
[1] Karlsruhe Inst Technol, AIFB, Karlsruhe, Germany
来源
SEMANTIC WEB, ESWC 2016 | 2016年 / 9989卷
关键词
Wikipedia; DBpedia; PageRank; Link analysis; Page views; Rank correlation;
D O I
10.1007/978-3-319-47602-5_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Link analysis methods are used to estimate importance in graph-structured data. In that realm, the PageRank algorithm has been used to analyze directed graphs, in particular the link structure of the Web. Recent developments in information retrieval focus on entities and their relations (i.e., knowledge graph panels). Many entities are documented in the popular knowledge base Wikipedia. The cross-references within Wikipedia exhibit a directed graph structure that is suitable for computing PageRank scores as importance indicators for entities. In this work, we present different PageRank-based analyses on the link graph of Wikipedia and according experiments. We focus on the question whether some links-based on their context/position in the article text-can be deemed more important than others. In our variants, we change the probabilistic impact of links in accordance to their context/position on the page and measure the effects on the output of the PageRank algorithm. We compare the resulting rankings and those of existing systems with page-view-based rankings and provide statistics on the pairwise computed Spearman and Kendall rank correlations.
引用
收藏
页码:227 / 240
页数:14
相关论文
共 50 条
  • [21] Towards Accurate Relation Extraction from Wikipedia
    Gu, Yulong
    Song, Jiaxing
    Liu, Weidong
    Yao, Yuan
    Zou, Lixin
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 89 - 96
  • [22] Pareto Principle and citizen participation in the editing of Wikipedia articles about Colombian government entities
    Aristizabal, Sergio llano
    Escobar, J. E. N. N. I. E. PEnA
    APUNTES-REVISTA DE CIENCIAS SOCIALES, 2025, 52 (98): : 59 - 87
  • [23] Mapping anatomical related entities to human body parts based on wikipedia in discharge summaries
    Yipei Wang
    Xingyu Fan
    Luoxin Chen
    Eric I-Chao Chang
    Sophia Ananiadou
    Junichi Tsujii
    Yan Xu
    BMC Bioinformatics, 20
  • [24] Efficacious Hyperlink Based Similarity Measure Using Heterogeneous Propagation of PageRank Scores
    Thangasamy, Vasantha
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2019, 9 (04) : 36 - 49
  • [25] Mapping anatomical related entities to human body parts based on wikipedia in discharge summaries
    Wang, Yipei
    Fan, Xingyu
    Chen, Luoxin
    Chang, Eric I-Chao
    Ananiadou, Sophia
    Tsujii, Junichi
    Xu, Yan
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [26] A Deeper Investigation of the Importance of Wikipedia Links to Search Engine Results
    Vincent N.
    Hecht B.
    Proceedings of the ACM on Human-Computer Interaction, 2021, 5 (CSCW1):
  • [27] Towards Automatic Cataloging of Image and Textual Collections with Wikipedia
    Suzuki, Tokinori
    Ikeda, Daisuke
    Galussakova, Petra
    Oard, Douglas
    DIGITAL LIBRARIES AT THE CROSSROADS OF DIGITAL INFORMATION FOR THE FUTURE, ICADL 2019, 2019, 11853 : 167 - 180
  • [28] Towards Detection of Influential Sentences Affecting Reputation in Wikipedia
    Zhou, Yiwei
    Cristea, Alexandra I.
    PROCEEDINGS OF THE 2016 ACM WEB SCIENCE CONFERENCE (WEBSCI'16), 2016, : 244 - 248
  • [29] Efficient Computing of PageRank Scores on Exact Expected Transition Matrix of Large Uncertain Graph
    Fushimi, Takayasu
    Saito, Kazumi
    Ohara, Kouzou
    Kimura, Masahiro
    Motoda, Hiroshi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 916 - 923
  • [30] An evaluation method for node importance based on pagerank in complex undirected weighted networks
    Li, F.
    Zhao, W. T.
    Sun, Z. F.
    Dong, B.
    Wang, Y. J.
    COMPUTING, CONTROL, INFORMATION AND EDUCATION ENGINEERING, 2015, : 847 - 851