Identifying Young Adults at High Risk for Weight Gain Using Machine Learning

被引:1
作者
Murtha, Jacqueline A. [1 ]
Birstler, Jen [2 ]
Stalter, Lily [1 ]
Jawara, Dawda [1 ]
Hanlon, Bret M. [1 ,2 ]
Hanrahan, Lawrence P. [3 ]
Churpek, Matthew M. [2 ,4 ]
Funk, Luke M. [1 ,5 ,6 ]
机构
[1] Univ Wisconsin, Dept Surg, Madison, WI USA
[2] Univ Wisconsin, Dept Biostat & Med Informat, Madison, WI USA
[3] Univ Wisconsin, Sch Med & Publ Hlth, Dept Family Med & Community Hlth, Madison, WI USA
[4] Univ Wisconsin, Dept Med, Madison, WI USA
[5] William S Middleton Mem VA, Dept Surg, Madison, WI USA
[6] Univ Wisconsin, Dept Surg, 600 Highland Ave,H4-728 Clin Sci Ctr, Madison, WI 53792 USA
关键词
Adult; Machine learning; Obesity; Weight gain; Young; UNITED-STATES; BODY-MASS; OBESITY; MORTALITY; PREVALENCE; GUIDELINES; OVERWEIGHT; CHILDHOOD; HEALTH; MODEL;
D O I
10.1016/j.jss.2023.05.015
中图分类号
R61 [外科手术学];
学科分类号
摘要
Introduction: Weight gain among young adults continues to increase. Identifying adults at high risk for weight gain and intervening before they gain weight could have a major public health impact. Our objective was to develop and test electronic health record-based machine learning models to predict weight gain in young adults with overweight/class 1 obesity.Methods: Seven machine learning models were assessed, including three regression models, random forest, single-layer neural network, gradient-boosted decision trees, and support vector machine (SVM) models. Four categories of predictors were included: 1) demographics; 2) obesity-related health conditions; 3) laboratory data and vital signs; and 4) neighborhood-level variables. The cohort was split 60:40 for model training and validation. Area under the receiver operating characteristic curves (AUC) were calculated to determine model accuracy at predicting high-risk individuals, defined by > 10% total body weight gain within 2 y. Variable importance was measured via generalized analysis of variance procedures.Results: Of the 24,183 patients (mean [SD] age, 32.0 [6.3] y; 55.1% females) in the study, 14.2% gained >10% total body weight. Area under the receiver operating characteristic curves varied from 0.557 (SVM) to 0.675 (gradient-boosted decision trees). Age, sex, and baseline body mass index were the most important predictors among the models except SVM and neural network.Conclusions: Our machine learning models performed similarly and had modest accuracy for identifying young adults at risk of weight gain. Future models may need to incorporate behavioral and/or genetic information to enhance model accuracy.Published by Elsevier Inc.
引用
收藏
页码:7 / 16
页数:10
相关论文
共 50 条
[41]  
U.S. Department of Veterans Affairs, 2022, OFF RES DEV MILL VET
[42]  
United States Department of Agriculture Economic Research Service, 2020, RUR URB COMM AR COD
[43]  
United States Environmental Protection Agency, WALK IND
[44]  
United Status Census Bureau, 2022, GLOSS CENS BLOCK GRO
[45]   What is the relationship between risk factor reduction and degree of weight loss? [J].
Van Gaal, LF ;
Mertens, IL ;
Ballaux, D .
EUROPEAN HEART JOURNAL SUPPLEMENTS, 2005, 7 (0L) :L21-L26
[46]  
von Elm E, 2008, J CLIN EPIDEMIOL, V61, P344, DOI [10.1016/j.jclinepi.2007.11.008, 10.2471/BLT.07.045120]
[47]   Can machine-learning improve cardiovascular risk prediction using routine clinical data? [J].
Weng, Stephen F. ;
Reps, Jenna ;
Kai, Joe ;
Garibaldi, Jonathan M. ;
Qureshi, Nadeem .
PLOS ONE, 2017, 12 (04)
[48]  
Williamson BD, 2021, BIOMETRICS, V77, P9, DOI 10.1111/biom.13392
[49]   Study of cardiovascular disease prediction model based on random forest in eastern China [J].
Yang, Li ;
Wu, Haibin ;
Jin, Xiaoqing ;
Zheng, Pinpin ;
Hu, Shiyun ;
Xu, Xiaoling ;
Yu, Wei ;
Yan, Jing .
SCIENTIFIC REPORTS, 2020, 10 (01)
[50]  
Zheng Z, 2017, 2017 IEEE INT C BIOI