Analysis of Web Browsing Data: A Guide

被引:5
|
作者
von Hohenberg, Bernhard Clemm [1 ,10 ]
Stier, Sebastian [2 ,8 ]
Cardenal, Ana S. [3 ]
Guess, Andrew M. [4 ,5 ]
Menchen-Trevino, Ericka [6 ]
Wojcieszak, Magdalena [7 ,9 ]
机构
[1] GESIS Leibniz Inst Social Sci, Cologne, Germany
[2] GESIS Leibniz Inst Social Sci, Computat Social Sci Dept, Cologne, Germany
[3] Univ Oberta Catalunya, Barcelona, Spain
[4] Princeton Univ, Polit & Publ Affairs, Princeton, NJ USA
[5] Princeton Univ, Ctr Informat Technol Policy, Princeton, NJ USA
[6] Amer Univ, Washington, DC USA
[7] Univ Calif Davis, Davis, CA USA
[8] Univ Mannheim, Sch Social Sci, Mannheim, Germany
[9] Univ Amsterdam, Amsterdam Sch Commun Res, Amsterdam, Netherlands
[10] GESIS Leibniz Inst SocialSciences, Dept Computat Social Sci, D-50667 Cologne, Germany
基金
欧洲研究理事会;
关键词
web browsing data; digital trace data; web tracking data; computational social science; ONLINE; NEWS;
D O I
10.1177/08944393241227868
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of individual-level browsing data, that is, the records of a person's visits to online content through a desktop or mobile browser, is of increasing importance for social scientists. Browsing data have characteristics that raise many questions for statistical analysis, yet to date, little hands-on guidance on how to handle them exists. Reviewing extant research, and exploring data sets collected by our four research teams spanning seven countries and several years, with over 14,000 participants and 360 million web visits, we derive recommendations along four steps: preprocessing the raw data; filtering out observations; classifying web visits; and modelling browsing behavior. The recommendations we formulate aim to foster best practices in the field, which so far has paid little attention to justifying the many decisions researchers need to take when analyzing web browsing data.
引用
收藏
页码:1479 / 1504
页数:26
相关论文
共 50 条
  • [41] Data Integrity Issues With Web-Based Studies:An Institutional Example of a Widespread Challenge
    French, Blandine
    Babbage, Camilla
    Bird, Katherine
    Marsh, Lauren
    Pelton, Mirabel
    Patel, Shireen
    Cassidy, Sarah
    Rennick-Egglestone, Stefan
    JMIR MENTAL HEALTH, 2024, 11
  • [42] Uncovering Specific Navigation Patterns by Assessing User Engagement of People With Dementia and Family Caregivers With an Advance Care Planning Website: Quantitative Analysis of Web Log Data
    Dupont, Charless
    Smets, Tinne
    Potts, Courtney
    Monnet, Fanny
    Pivodic, Lara
    De Vleminck, Aline
    Van Audenhove, Chantal
    Mulvenna, Maurice
    van den Block, Lieve
    JMIR AGING, 2025, 8
  • [43] Using lexicometry and vocabulary analysis techniques to detect a signature for web profile
    El Bouanani, Sara El Manar
    Kassou, Ismail
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 1494 - 1498
  • [44] Integrated visual analysis of patterns in time series and text data - Workflow and application to financial data analysis
    Wanner, Franz
    Jentner, Wolfgang
    Schreck, Tobias
    Stoffel, Andreas
    Sharalieva, Lyubka
    Keim, Daniel A.
    INFORMATION VISUALIZATION, 2016, 15 (01) : 75 - 90
  • [45] The Dark Web and cannabis use in the United States: Evidence from a big data research design
    Jardine, Eric
    Lindner, Andrew M.
    INTERNATIONAL JOURNAL OF DRUG POLICY, 2020, 76
  • [46] Sources and data for social network analysis
    Bes, Marie-Pierre
    Favre, Guillaume
    Lemercier, Claire
    BMS-BULLETIN OF SOCIOLOGICAL METHODOLOGY-BULLETIN DE METHODOLOGIE SOCIOLOGIQUE, 2021, 152 (01): : 10 - 51
  • [47] Content Characteristics and Transmission Strategies of Social Media Rumors in China: Big Data Analysis of WeChat Rumors
    He, Lingnan
    Gu, Jing
    Li, Dan
    Lai, Kaisheng
    2019 6TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC AND SOCIO-CULTURAL COMPUTING (BESC 2019), 2019,
  • [48] Preventing and Protecting Against Internet Research Fraud in Anonymous Web-Based Research: Protocol for the Development and Implementation of an Anonymous Web-Based Data Integrity Plan
    Hohn, Kris L.
    Braswell, April A.
    DeVita, James M.
    JMIR RESEARCH PROTOCOLS, 2022, 11 (09):
  • [49] A new culture of advocacy: An exploratory analysis of social activism on the web and social media
    Seelig, Michelle, I
    Millette, Diane
    Zhou, Chun
    Huang, Jialing
    ATLANTIC JOURNAL OF COMMUNICATION, 2019, 27 (01) : 15 - 29
  • [50] Analysis of student activity in web-supported courses as a tool for predicting dropout
    Cohen, Anat
    ETR&D-EDUCATIONAL TECHNOLOGY RESEARCH AND DEVELOPMENT, 2017, 65 (05): : 1285 - 1304