Study of cardiovascular disease prediction model based on random forest in eastern China

被引:223
作者
Yang, Li [1 ,4 ]
Wu, Haibin [2 ]
Jin, Xiaoqing [3 ]
Zheng, Pinpin [4 ]
Hu, Shiyun [1 ]
Xu, Xiaoling [1 ]
Yu, Wei [1 ]
Yan, Jing [1 ]
机构
[1] Zhejiang Hosp, Zhejiang Prov Ctr Cardiovasc Dis Control & Preven, Hangzhou 310013, Peoples R China
[2] Ewell Technol Co Ltd, Tower Oriental Commun Technol City, Hangzhou 310000, Peoples R China
[3] Zhejiang Hosp, Chinese Acupuncture Dept, Hangzhou 310013, Peoples R China
[4] Fudan Univ, Key Lab Publ Hlth Safety, Minist Educ, Hlth Commun Inst, 138 Yixueyuan Rd, Shanghai 200032, Peoples R China
基金
美国国家科学基金会;
关键词
RISK; VALIDATION;
D O I
10.1038/s41598-020-62133-5
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Cardiovascular disease (CVD) is the leading cause of death worldwide and a major public health concern. CVD prediction is one of the most effective measures for CVD control. In this study, 29930 subjects with high-risk of CVD were selected from 101056 people in 2014, regular follow-up was conducted using electronic health record system. Logistic regression analysis showed that nearly 30 indicators were related to CVD, including male, old age, family income, smoking, drinking, obesity, excessive waist circumference, abnormal cholesterol, abnormal low-density lipoprotein, abnormal fasting blood glucose and else. Several methods were used to build prediction model including multivariate regression model, classification and regression tree (CART), Naive Bayes, Bagged trees, Ada Boost and Random Forest. We used the multivariate regression model as a benchmark for performance evaluation (Area under the curve, AUC = 0.7143). The results showed that the Random Forest was superior to other methods with an AUC of 0.787 and achieved a significant improvement over the benchmark. We provided a CVD prediction model for 3-year risk assessment of CVD. It was based on a large population with high risk of CVD in eastern China using Random Forest algorithm, which would provide reference for the work of CVD prediction and treatment in China.
引用
收藏
页数:8
相关论文
共 29 条
[1]   Cardiovascular disease risk prediction using automated machine learning: A prospective study of 423,604 UK Biobank participants [J].
Alaa, Ahmed M. ;
Bolton, Thomas ;
Di Angelantonio, Emanuele ;
Rudd, James H. F. ;
van der Schaar, Mihaela .
PLOS ONE, 2019, 14 (05)
[2]   Discrimination and Calibration of Clinical Prediction Models Users' Guides to the Medical Literature [J].
Alba, Ana Carolina ;
Agoritsas, Thomas ;
Walsh, Michael ;
Hanna, Steven ;
Iorio, Alfonso ;
Devereaux, P. J. ;
McGinn, Thomas ;
Guyatt, Gordon .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (14) :1377-1384
[3]  
[Anonymous], 2012, BMJ BRIT MED J, DOI DOI 10.1136/BMJ.E3318
[4]  
[Anonymous], 2019, CHINESE J PREVENTIVE, V53, P13
[5]  
[Anonymous], 3 REP EXP PAN DET EV
[6]  
[Anonymous], 2007, ZHONGHUA XINXUEGUANB
[7]   Implications of scaling up cardiovascular disease treatment in South Africa: a microsimulation and cost-effectiveness analysis [J].
Basu, Sanjay ;
Wagner, Ryan G. ;
Sewpaul, Ronel ;
Reddy, Priscilla ;
Davies, Justine .
LANCET GLOBAL HEALTH, 2019, 7 (02) :E270-E280
[8]   Prediction of Cardiovascular Risk to Guide Primary Prevention [J].
Curfman, Gregory D. .
JAMA INTERNAL MEDICINE, 2018, 178 (09) :1240-1241
[9]   Contemporary cardiovascular risk prediction [J].
Damen, Johanna A. A. G. ;
Hooft, Lotty ;
Moons, Karel G. M. .
LANCET, 2018, 391 (10133) :1867-1868
[10]   Prediction models for cardiovascular disease risk in the general population: systematic review [J].
Damen, Johanna A. A. G. ;
Hooft, Lotty ;
Schuit, Ewoud ;
Debray, Thomas P. A. ;
Collins, Gary S. ;
Tzoulaki, Ioanna ;
Lassale, Camille M. ;
Siontis, George C. M. ;
Chiocchia, Virginia ;
Roberts, Corran ;
Schlussel, Michael Maia ;
Gerry, Stephen ;
Black, James A. ;
Heus, Pauline ;
van der Schouw, Yvonne T. ;
Peelen, Linda M. ;
Moons, Karel G. M. .
BMJ-BRITISH MEDICAL JOURNAL, 2016, 353