Development of machine learning prediction models to explore nutrients predictive of cardiovascular disease using Canadian linked population-based data

被引:11
作者
Morgenstern, Jason D. [1 ]
Rosella, Laura C. [2 ,3 ,4 ]
Costa, Andrew P. [1 ,3 ,5 ]
Anderson, Laura N. [1 ,6 ]
机构
[1] McMaster Univ, Dept Hlth Res Methods Evidence & Impact, Hamilton, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
[3] Inst Clin Evaluat Sci ICES, Toronto, ON, Canada
[4] Vector Inst, Toronto, ON, Canada
[5] McMaster Univ, Dept Med, Hamilton, ON, Canada
[6] Hamilton Hlth Sci, Populat Hlth Res Inst, Hamilton, ON, Canada
基金
加拿大健康研究院;
关键词
machine learning; cardiovascular disease; heart disease; stroke; 24-hour dietary recall; Canadian; population health; nutritional epidemiology; artificial intelligence; DIETARY FACTORS; HEART-DISEASE; ALL-CAUSE; BIG DATA; MORTALITY; SODIUM; HEALTH; RISK; ASSOCIATIONS; METAANALYSIS;
D O I
10.1139/apnm-2021-0502
中图分类号
R15 [营养卫生、食品卫生]; TS201 [基础科学];
学科分类号
100403 ;
摘要
Machine learning may improve use of observational data to understand the nutritional epidemiology of cardiovascular disease (CVD) through better modelling of non-linearity, non-additivity, and dietary complexity. Our objective was to develop machine learning prediction models for exploring how nutrients are related to CVD risk and to evaluate their predictive performance. We established a population-based cohort from the Canadian Community Health Survey and measured CVD incidence and mortality from 2004 to 2018 using administrative databases of national hospital discharges and deaths. Predictors included 61 nutrition variables and fourteen socioeconomic, demographic, psychological, and behavioural variables. Conditional inference forest models were interpreted and evaluated by permutation feature importance, accumulated local effects, and predictive discrimination and calibration. A total of 12 130 individuals were included in the study. Use of supplements, caffeine, and alcohol were the most important nutrition variables for prediction of CVD. Supplement use was associated with decreased risk, caffeine was associated with increasing risk, and alcohol had a u-shaped association with risk. The model had an out-of-sample c-statistic of 0.821 (95% confidence interval = 0.801???0.842). Exploratory findings included both known and novel associations and predictive performance was competitive, suggesting that further application of machine learning to nutritional epidemiology may help elucidate risks and improve predictive models.
引用
收藏
页码:529 / 546
页数:18
相关论文
共 82 条
[1]   Polyunsaturated fatty acids for the primary and secondary prevention of cardiovascular disease [J].
Abdelhamid, Asmaa S. ;
Martin, Nicole ;
Bridges, Charlene ;
Brainard, Julii S. ;
Wang, Xia ;
Brown, Tracey J. ;
Hanson, Sarah ;
Jimoh, Oluseyi F. ;
Ajabnoor, Sarah M. ;
Deane, Katherine H. O. ;
Song, Fujian ;
Hooper, Lee .
COCHRANE DATABASE OF SYSTEMATIC REVIEWS, 2018, (07)
[2]   Health effects of dietary risks in 195 countries, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017 [J].
Afshin, Ashkan ;
Sur, Patrick John ;
Fay, Kairsten A. ;
Cornaby, Leslie ;
Ferrara, Giannina ;
Salama, Joseph S. ;
Mullany, Erin C. ;
Abate, Kalkidan Hassen ;
Abbafati, Cristiana ;
Abebe, Zegeye ;
Afarideh, Mohsen ;
Aggarwal, Anju ;
Agrawal, Sutapa ;
Akinyemiju, Tomi ;
Alahdab, Fares ;
Bacha, Umar ;
Bachman, Victoria F. ;
Badali, Hamid ;
Badawi, Alaa ;
Bensenor, Isabela M. ;
Bernabe, Eduardo ;
Biryukov, Stan H. ;
Biadgilign, Sibhatu Kassa K. ;
Cahill, Leah E. ;
Carrero, Juan J. ;
Cercy, Kelly M. ;
Dandona, Lalit ;
Dandona, Rakhi ;
Anh Kim Dang ;
Degefa, Meaza Girma ;
Zaki, Maysaa El Sayed ;
Esteghamati, Alireza ;
Esteghamati, Sadaf ;
Fanzo, Jessica ;
Farinha, Carla Sofia E. Sa ;
Farvid, Maryam S. ;
Farzadfar, Farshad ;
Feigin, Valery L. ;
Fernandes, Joao C. ;
Flor, Luisa Sorio ;
Foigt, Nataliya A. ;
Forouzanfar, Mohammad H. ;
Ganji, Morsaleh ;
Geleijnse, Johanna M. ;
Gillum, Richard F. ;
Goulart, Alessandra C. ;
Grosso, Giuseppe ;
Guessous, Idris ;
Hamidi, Samer ;
Hankey, Graeme J. .
LANCET, 2019, 393 (10184) :1958-1972
[3]   External validation and comparison of four cardiovascular risk prediction models with data from the Australian Diabetes, Obesity and Lifestyle study [J].
Albarqouni, Loai ;
Doust, Jennifer A. ;
Magliano, Dianna ;
Barr, Elizabeth L. M. ;
Shaw, Jonathan E. ;
Glasziou, Paul P. .
MEDICAL JOURNAL OF AUSTRALIA, 2019, 210 (04) :161-167
[4]  
[Anonymous], 2014, Ethical conduct for research involving humans
[5]  
[Anonymous], 2008, CANADIAN COMMUNITY H
[6]  
[Anonymous], 2004, CANADIAN COMMUNITY H
[7]   Fruit and vegetable intake and the risk of cardiovascular disease, total cancer and all-cause mortality-a systematic review and dose-response meta-analysis of prospective studies [J].
Aune, Dagfinn ;
Giovannucci, Edward ;
Boffetta, Paolo ;
Fadnes, Lars T. ;
Keum, NaNa ;
Norat, Teresa ;
Greenwood, Darren C. ;
Riboli, Elio ;
Vatten, Lars J. ;
Tonstad, Serena .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2017, 46 (03) :1029-1056
[8]   The USDA automated multiple-pass method accurately estimates group total energy and nutrient intake [J].
Blanton, Cynthia A. ;
Moshfegh, Alanna J. ;
Baer, David J. ;
Kretsch, Mary J. .
JOURNAL OF NUTRITION, 2006, 136 (10) :2594-2599
[9]   Machine learning as a strategy to account for dietary synergy: an illustration based on dietary intake and adverse pregnancy outcomes [J].
Bodnar, Lisa M. ;
Cartus, Abigail R. ;
Kirkpatrick, Sharon, I ;
Himes, Katherine P. ;
Kennedy, Edward H. ;
Simhan, Hyagriv N. ;
Grobman, William A. ;
Duffy, Jennifer Y. ;
Silver, Robert M. ;
Parry, Samuel ;
Naimi, Ashley, I .
AMERICAN JOURNAL OF CLINICAL NUTRITION, 2020, 111 (06) :1235-1243
[10]   Sugar-Sweetened Beverage, Sugar Intake of Individuals, and Their Blood Pressure International Study of Macro/Micronutrients and Blood Pressure [J].
Brown, Ian J. ;
Stamler, Jeremiah ;
Van Horn, Linda ;
Robertson, Claire E. ;
Chan, Queenie ;
Dyer, Alan R. ;
Huang, Chiang-Ching ;
Rodriguez, Beatriz L. ;
Zhao, Liancheng ;
Daviglus, Martha L. ;
Ueshima, Hirotsugu ;
Elliott, Paul .
HYPERTENSION, 2011, 57 (04) :695-+