A Large-Scale Characterization of How Readers Browse Wikipedia

被引:5
作者
Piccardi, Tiziano [1 ]
Gerlach, Martin [2 ]
Arora, Akhil [1 ]
West, Robert [1 ]
机构
[1] Ecole Polytech Fed Lausanne, CH-1015 Lausanne, Switzerland
[2] Wikimedia Fdn, 120 Kearny St, San Francisco, CA 94104 USA
基金
欧盟地平线“2020”; 瑞士国家科学基金会; 美国国家科学基金会;
关键词
Wikipedia; web navigation; server logs; information needs; WEB NAVIGATION; INFORMATION BEHAVIOR; USABILITY; SEARCH; MODEL; USER;
D O I
10.1145/3580318
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Despite the importance and pervasiveness of Wikipedia as one of the largest platforms for open knowledge, surprisingly little is known about how people navigate its content when seeking information. To bridge this gap, we present the first systematic large-scale analysis of how readers browse Wikipedia. Using billions of page requests from Wikipedia's server logs, we measure how readers reach articles, how they transition between articles, and how these patterns combine into more complex navigation paths. We find that navigation behavior is characterized by highly diverse structures. Although most navigation paths are shallow, comprising a single pageload, there is much variety, and the depth and shape of paths vary systematically with topic, device type, and time of day. We show that Wikipedia navigation paths commonly mesh with external pages as part of a larger online ecosystem, and we describe how naturally occurring navigation paths are distinct from targeted navigation in lab-based settings. Our results further suggest that navigation is abandoned when readers reach low-quality pages. Taken together, these insights contribute to a more systematic understanding of readers' information needs and allow for improving their experience on Wikipedia and the Web in general.
引用
收藏
页数:22
相关论文
共 87 条
[1]  
AaronHalfaker R., 2019, P HUMANCOMPUTER INTE
[2]   The Dynamics of Repeat Consumption [J].
Anderson, Ashton ;
Kumar, Ravi ;
Tomkins, Andrew ;
Vassilvitskii, Sergei .
WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, :419-429
[3]  
Andreescu Dan, 2021, SEARCHING WIKIPEDIA
[4]  
[Anonymous], 1945, Atlantic, DOI DOI 10.1145/227181.227186
[5]  
[Anonymous], 2001, P 24 ANN INT ACM SIG, DOI DOI 10.1145/383952.383991
[6]  
[Anonymous], 2001, P SIGCHI C HUM FACT, DOI [10.1145/365024.365325, DOI 10.1145/365024.365325]
[7]  
[Anonymous], 2014, Medium -Term Plan for the Period From 2015 to 2019
[8]   Wikipedia Reader Navigation: When Synthetic Data Is Enough [J].
Arora, Akhil ;
Gerlach, Martin ;
Piccardi, Tiziano ;
Garcia-Duran, Alberto ;
West, Robert .
WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, :16-26
[9]   Web navigation prediction using multiple evidence combination and domain knowledge [J].
Awad, Mamoun A. ;
Khan, Latifur R. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2007, 37 (06) :1054-1062
[10]   THE DESIGN OF BROWSING AND BERRYPICKING TECHNIQUES FOR THE ONLINE SEARCH INTERFACE [J].
BATES, MJ .
ONLINE REVIEW, 1989, 13 (05) :407-424