Leveraging Machine Learning to Develop Digital Engagement Phenotypes of Users in a Digital Diabetes Prevention Program: Evaluation Study

被引:1
作者
Rodriguez, Danissa, V [1 ]
Chen, Ji [1 ]
Viswanadham, Ratnalekha V. N. [1 ]
Lawrence, Katharine [1 ,2 ]
Mann, Devin [1 ,2 ]
机构
[1] NYU, Grosman Sch Med, 227 E 30th St,6th Floor, New York, NY 10016 USA
[2] NYU Langone Hlth, New York, NY USA
来源
JMIR AI | 2024年 / 3卷
关键词
ADHERENCE; FUTURE;
D O I
10.2196/47122
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Digital diabetes prevention programs (dDPPs) are effective "digital prescriptions" but have high attrition rates and program noncompletion. To address this, we developed a personalized automatic messaging system (PAMS) that leverages SMS text messaging and data integration into clinical workflows to increase dDPP engagement via enhanced patient-provider communication. Preliminary data showed positive results. However, further investigation is needed to determine how to optimize the tailoring of support technology such as PAMS based on a user's preferences to boost their dDPP engagement. Objective: This study evaluates leveraging machine learning (ML) to develop digital engagement phenotypes of dDPP users and assess ML's accuracy in predicting engagement with dDPP activities. This research will be used in a PAMS optimization process to improve PAMS personalization by incorporating engagement prediction and digital phenotyping. This study aims (1) to prove the feasibility of using dDPP user-collected data to build an ML model that predicts engagement and contributes to identifying digital engagement phenotypes, (2) to describe methods for developing ML models with dDPP data sets and present preliminary results, and (3) to present preliminary data on user profiling based on ML model outputs. Methods: Using the gradient-boosted forest model, we predicted engagement in 4 dDPP individual activities (physical activity, lessons, social activity, and weigh-ins) and general activity (engagement in any activity) based on previous short- and long-term activity in the app. The area under the receiver operating characteristic curve, the area under the precision-recall curve, and the Brier score metrics determined the performance of the model. Shapley values reflected the feature importance of the models and determined what variables informed user profiling through latent profile analysis. Results: We developed 2 models using weekly and daily DPP data sets (328,821 and 704,242 records, respectively), which yielded predictive accuracies above 90%. Although both models were highly accurate, the daily model better fitted our research plan because it predicted daily changes in individual activities, which was crucial for creating the "digital phenotypes." To better understand the variables contributing to the model predictor, we calculated the Shapley values for both models to identify the features with the highest contribution to model fit; engagement with any activity in the dDPP in the last 7 days had the most predictive power. We profiled users with latent profile analysis after 2 weeks of engagement (Bayesian information criterion=-3222.46) with the dDPP and identified 6 profiles of users, including those with high engagement, minimal engagement, Conclusions: Preliminary results demonstrate that applying ML methods with predicting power is an acceptable mechanism to tailor and optimize messaging interventions to support patient engagement and adherence to digital prescriptions. The results enable future optimization of our existing messaging platform and expansion of this methodology to other clinical domains.
引用
收藏
页数:13
相关论文
共 25 条
[1]   The Effectiveness of Technology-Based Strategies to Promote Engagement With Digital Interventions: A Systematic Review Protocol [J].
Alkhaldi, Ghadah ;
Hamilton, Fiona L. ;
Lau, Rosa ;
Webster, Rosie ;
Michie, Susan ;
Murray, Elizabeth .
JMIR RESEARCH PROTOCOLS, 2015, 4 (02)
[2]  
[Anonymous], 2017, Making sense of evaluation: A handbook for everyone
[3]  
Bohlmann Aaron, 2021, JMIRx Med, V2, pe26993, DOI 10.2196/26993
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]  
Dimitrov DV, 2016, HEALTHC INFORM RES, V22, P156
[6]   Predicting therapy outcome in a digital mental health intervention for depression and anxiety: A machine learning approach [J].
Hornstein, Silvan ;
Forman-Hoffman, Valerie ;
Nazander, Albert ;
Ranta, Kristian ;
Hilbert, Kevin .
DIGITAL HEALTH, 2021, 7
[7]  
Horrigan JB., 2015, Pew Research Center, P21
[8]   Medicine 2032: The future of cardiovascular disease prevention with machine learning and digital health technology [J].
Javaid, Aamir ;
Zghyer, Fawzi ;
Kim, Chang ;
Spaulding, Erin M. ;
Isakadze, Nino ;
Ding, Jie ;
Kargillis, Daniel ;
Gao, Yumin ;
Rahman, Faisal ;
Brown, Donald E. ;
Saria, Suchi ;
Martin, Seth S. ;
Kramer, Christopher M. ;
Blumenthal, Roger S. ;
Marvel, Francoise A. .
AMERICAN JOURNAL OF PREVENTIVE CARDIOLOGY, 2022, 12
[9]   Online Prevention Aimed at Lifestyle Behaviors: A Systematic Review of Reviews [J].
Kohl, Leonie Fm ;
Crutzen, Rik ;
de Vries, Nanne K. .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2013, 15 (07) :71-83
[10]   Conversational agents in healthcare: a systematic review [J].
Laranjo, Liliana ;
Dunn, Adam G. ;
Tong, Huong Ly ;
Kocaballi, Ahmet Baki ;
Chen, Jessica ;
Bashir, Rabia ;
Surian, Didi ;
Gallego, Blanca ;
Magrabi, Farah ;
Lau, Annie Y. S. ;
Coiera, Enrico .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (09) :1248-1258