Analysis of Web Browsing Data: A Guide

被引:5
|
作者
von Hohenberg, Bernhard Clemm [1 ,10 ]
Stier, Sebastian [2 ,8 ]
Cardenal, Ana S. [3 ]
Guess, Andrew M. [4 ,5 ]
Menchen-Trevino, Ericka [6 ]
Wojcieszak, Magdalena [7 ,9 ]
机构
[1] GESIS Leibniz Inst Social Sci, Cologne, Germany
[2] GESIS Leibniz Inst Social Sci, Computat Social Sci Dept, Cologne, Germany
[3] Univ Oberta Catalunya, Barcelona, Spain
[4] Princeton Univ, Polit & Publ Affairs, Princeton, NJ USA
[5] Princeton Univ, Ctr Informat Technol Policy, Princeton, NJ USA
[6] Amer Univ, Washington, DC USA
[7] Univ Calif Davis, Davis, CA USA
[8] Univ Mannheim, Sch Social Sci, Mannheim, Germany
[9] Univ Amsterdam, Amsterdam Sch Commun Res, Amsterdam, Netherlands
[10] GESIS Leibniz Inst SocialSciences, Dept Computat Social Sci, D-50667 Cologne, Germany
基金
欧洲研究理事会;
关键词
web browsing data; digital trace data; web tracking data; computational social science; ONLINE; NEWS;
D O I
10.1177/08944393241227868
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of individual-level browsing data, that is, the records of a person's visits to online content through a desktop or mobile browser, is of increasing importance for social scientists. Browsing data have characteristics that raise many questions for statistical analysis, yet to date, little hands-on guidance on how to handle them exists. Reviewing extant research, and exploring data sets collected by our four research teams spanning seven countries and several years, with over 14,000 participants and 360 million web visits, we derive recommendations along four steps: preprocessing the raw data; filtering out observations; classifying web visits; and modelling browsing behavior. The recommendations we formulate aim to foster best practices in the field, which so far has paid little attention to justifying the many decisions researchers need to take when analyzing web browsing data.
引用
收藏
页码:1479 / 1504
页数:26
相关论文
共 50 条
  • [21] Effect of Prosocial Behaviors on e-Consultations in a Web-Based Health Care Community: Panel Data Analysis
    Liu, Xiaoxiao
    Guo, Huijing
    Wang, Le
    Hu, Mingye
    Wei, Yichan
    Liu, Fei
    Wang, Xifu
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [22] Design and analysis of the electronic helical guide controller
    Tian, Xiaoqing
    Han, Jiang
    Wu, Lulu
    Xia, Lian
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2017, 93 (1-4) : 1069 - 1078
  • [23] Web data mining for automatic inventory of geohazards at national scale
    Battistini, Alessandro
    Segoni, Samuele
    Manzo, Goffredo
    Catani, Filippo
    Casagli, Nicola
    APPLIED GEOGRAPHY, 2013, 43 : 147 - 158
  • [24] A Data-Driven Approach to Measure Web Site Navigability
    Fang, Xiao
    Hu, Paul Jen-Hwa
    Chau, Michael
    Hu, Han-fen
    Yang, Zhuo
    Sheng, Olivia R. Liu
    JOURNAL OF MANAGEMENT INFORMATION SYSTEMS, 2012, 29 (02) : 173 - 212
  • [25] Data Quality in HIV/AIDS Web-Based Surveys: Handling Invalid and Suspicious Data
    Bauermeister, Jose A.
    Pingel, Emily
    Zimmerman, Marc
    Couper, Mick
    Carballo-Dieguez, Alex
    Strecher, Victor J.
    FIELD METHODS, 2012, 24 (03) : 272 - 291
  • [26] Provision of Paid Web-Based Medical Consultation in China: Cross-Sectional Analysis of Data From a Medical Consultation Website
    Li, Yumei
    Yan, Xiangbin
    Song, Xiaolong
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2019, 21 (06)
  • [27] Estimating Weights for Web-Scraped Data in Consumer Price Indices
    Ayoubkhani, Daniel
    Thomas, Heledd
    JOURNAL OF OFFICIAL STATISTICS, 2022, 38 (01) : 5 - 21
  • [28] Online versus offline: The Web as a medium for response time data collection
    Andrey Chetverikov
    Philipp Upravitelev
    Behavior Research Methods, 2016, 48 : 1086 - 1099
  • [29] Online versus offline: The Web as a medium for response time data collection
    Chetverikov, Andrey
    Upravitelev, Philipp
    BEHAVIOR RESEARCH METHODS, 2016, 48 (03) : 1086 - 1099
  • [30] Library science analysis of Mexican television news on the web
    Soto-Hernandez, Silvano
    Naumis-Pena, Catalina
    PROFESIONAL DE LA INFORMACION, 2014, 23 (01): : 80 - 86