PageRank on Wikipedia: Towards General Importance Scores for Entities

被引:21
|
作者
Thalhammer, Andreas [1 ]
Rettinger, Achim [1 ]
机构
[1] Karlsruhe Inst Technol, AIFB, Karlsruhe, Germany
来源
SEMANTIC WEB, ESWC 2016 | 2016年 / 9989卷
关键词
Wikipedia; DBpedia; PageRank; Link analysis; Page views; Rank correlation;
D O I
10.1007/978-3-319-47602-5_41
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Link analysis methods are used to estimate importance in graph-structured data. In that realm, the PageRank algorithm has been used to analyze directed graphs, in particular the link structure of the Web. Recent developments in information retrieval focus on entities and their relations (i.e., knowledge graph panels). Many entities are documented in the popular knowledge base Wikipedia. The cross-references within Wikipedia exhibit a directed graph structure that is suitable for computing PageRank scores as importance indicators for entities. In this work, we present different PageRank-based analyses on the link graph of Wikipedia and according experiments. We focus on the question whether some links-based on their context/position in the article text-can be deemed more important than others. In our variants, we change the probabilistic impact of links in accordance to their context/position on the page and measure the effects on the output of the PageRank algorithm. We compare the resulting rankings and those of existing systems with page-view-based rankings and provide statistics on the pairwise computed Spearman and Kendall rank correlations.
引用
收藏
页码:227 / 240
页数:14
相关论文
共 50 条
  • [1] Discovering Missing Semantic Relations between Entities in Wikipedia
    Xu, Mengling
    Wang, Zhichun
    Bie, Rongfang
    Li, Juanzi
    Zheng, Chen
    Ke, Wantian
    Zhou, Mingquan
    SEMANTIC WEB - ISWC 2013, PART I, 2013, 8218 : 673 - 686
  • [2] Linking, Searching, and Visualizing Entities in Wikipedia
    Klang, Marcus
    Nugues, Pierre
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2018), 2018, : 3426 - 3432
  • [3] Matching named entities with the aid of Wikipedia
    Bawakid, Abdullah
    Oussalah, Mourad
    Afzal, Naveed
    Shim, Seong
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2015, 23 (04) : 1051 - 1068
  • [4] Vector Embedding of Wikipedia Concepts and Entities
    Sherkat, Ehsan
    Milios, Evangelos E.
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 418 - 428
  • [5] Collective Annotation of Wikipedia Entities in Web Text
    Kulkarni, Sayali
    Singh, Amit
    Ramakrishnan, Ganesh
    Chakrabarti, Soumen
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 457 - 465
  • [6] Quality and Importance of Wikipedia Articles in Different Languages
    Lewoniewski, Wlodzimierz
    Wecel, Krzysztof
    Abramowicz, Witold
    INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2016, 2016, 639 : 613 - 624
  • [7] Detection and Graphical Visualization of Relationships between Entities in Wikipedia
    Schmidt, Andreas
    PROCEEDINGS OF THE 2017 7TH INTERNATIONAL CONFERENCE INTERNET TECHNOLOGIES AND APPLICATIONS (ITA), 2017, : 24 - 28
  • [8] Wikipedia Entry Augmentation by Sub-merging Entities Based on Multilingual Ontology
    Ankon, Md Tasnim Manzur
    Ali, Muhammad Masroor
    2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT), 2017,
  • [9] Swat: A system for detecting salient Wikipedia entities in texts
    Ponza, Marco
    Ferragina, Paolo
    Piccinno, Francesco
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (04) : 858 - 890
  • [10] Building The Indonesian NE Dataset Using Wikipedia and DBpedia with Entities Expansion Method on DBpedia
    Alfarohmi, Haji Dito Murya
    Bijaksana, Moch. Arif
    2018 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2018, : 334 - 339