Predicting Web Survey Breakoffs Using Machine Learning Models

被引:0
|
作者
Chen, Zeming [1 ]
Cernat, Alexandru [2 ]
Shlomo, Natalie [2 ]
机构
[1] Univ Manchester, Social Stat Dept, Manchester, Lancs, England
[2] Univ Manchester, Social Stat Dept, Social Stat, Manchester, Lancs, England
关键词
breakoff timing; time-varying variables; Cox model; LASSO Cox model; logistic regression; random forest; gradient boosting; support vector machine; RATES; TREE;
D O I
10.1177/08944393221112000
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Web surveys are becoming increasingly popular but tend to have more breakoffs compared to the interviewer-administered surveys. Survey breakoffs occur when respondents quit the survey partway through. The Cox survival model is commonly used to understand patterns of breakoffs. Nevertheless, there is a trend to using more data-driven models when the purpose is prediction, such as classification machine learning models. It is unclear in the breakoff literature what are the best statistical models for predicting question-level breakoffs. Additionally, there is no consensus about the treatment of time-varying question-level predictors, such as question response time and question word count. While some researchers use the current values, others aggregate the value from the beginning of the survey. This study develops and compares both survival models and classification models along with different treatments of time-varying variables. Based on the level of agreement between the predicted and actual breakoff, we find that the Cox model and gradient boosting outperform other survival models and classification models respectively. We also find that using the values of time-varying predictors concurrent to the breakoff status is more predictive of breakoff, compared to aggregating their values from the beginning of the survey, implying that respondents' breakoff behaviour is more driven by the current response burden.
引用
收藏
页码:573 / 591
页数:19
相关论文
共 50 条
  • [41] Predicting the Air Quality Using Machine Learning Algorithms: A Comparative Study
    Goel, Neetika
    Kumari, Ritika
    Bansal, Poonam
    SMART TRENDS IN COMPUTING AND COMMUNICATIONS, VOL 1, SMARTCOM 2024, 2024, 945 : 137 - 147
  • [42] Severity modeling of work zone crashes in New Jersey using machine learning models
    Hasan, Ahmed Sajid
    Kabir, Md Asif Bin
    Jalayer, Mohammad
    Das, Subasish
    JOURNAL OF TRANSPORTATION SAFETY & SECURITY, 2023, 15 (06) : 604 - 635
  • [43] Machine Learning Models for Predicting Neonatal Mortality: A Systematic Review
    Mangold, Cheyenne
    Zoretic, Sarah
    Thallapureddy, Keerthi
    Moreira, Axel
    Chorath, Kevin
    Moreira, Alvaro
    NEONATOLOGY, 2021, 118 (04) : 394 - 405
  • [44] Machine learning models for predicting survival in patients with ampullary adenocarcinoma
    Huang, Tao
    Huang, Liying
    Yang, Rui
    Li, Shuna
    He, Ningxia
    Feng, Aozi
    Li, Li
    Lyu, Jun
    ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2022, 9 (12)
  • [45] Landslide susceptibility assessment using feature selection-based machine learning models
    Liu, Lei-Lei
    Yang, Can
    Wang, Xiao-Mi
    GEOMECHANICS AND ENGINEERING, 2021, 25 (01) : 1 - 16
  • [46] Predicting tool life and sound pressure levels in dry turning using machine learning models
    de Souza, Alex Fernandes
    Verri, Filipe Alves Neto
    Campos, Paulo Henrique da Silva
    Balestrassi, Pedro Paulo
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2024, 135 (7-8) : 3777 - 3793
  • [47] Predicting detonation cell size of biogas-oxygen mixtures using machine learning models
    Siatkowski, S.
    Wacko, K.
    Kindracki, J.
    SHOCK WAVES, 2024, 34 (02) : 129 - 137
  • [48] Predicting maturity and identifying key factors in organic waste composting using machine learning models
    Wang, Ning
    Yang, Wanli
    Wang, Bingshu
    Bai, Xinyue
    Wang, Xinwei
    Xu, Qiyong
    BIORESOURCE TECHNOLOGY, 2024, 400
  • [49] MACHINE LEARNING FOR PREDICTING OUTCOMES IN TRAUMA
    Liu, Nehemiah T.
    Salinas, Jose
    SHOCK, 2017, 48 (05): : 504 - 510
  • [50] Assessment of Landslide Susceptibility using Geospatial Techniques: A Comparative Evaluation of Machine Learning and Statistical Models
    Raut, Subrata
    Dutta, Dipanwita
    Bera, Debarati
    Samanta, Rajeeb
    GEOLOGICAL JOURNAL, 2024, : 1129 - 1149