Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

被引:24
作者
Flores, Alyssa M. [1 ]
Schuler, Alejandro [2 ]
Eberhard, Anne Verena [1 ]
Olin, Jeffrey W. [3 ]
Cooke, John P. [4 ]
Leeper, Nicholas J. [1 ,5 ,6 ]
Shah, Nigam H. [2 ]
Ross, Elsie G. [1 ,2 ,6 ]
机构
[1] Stanford Univ, Dept Surg, Div Vasc Surg, Sch Med, Stanford, CA 94305 USA
[2] Stanford Univ, Ctr Biomed Informat Res, Stanford, CA 94305 USA
[3] Icahn Sch Med Mt Sinai, Zena & Michael A Wiener Cardiovasc Inst, Marie Josee & Henry R Kravis Ctr Cardiovasc Hlth, New York, NY 10029 USA
[4] Houston Methodist Res Inst, Dept Cardiovasc Sci, Houston, TX USA
[5] Stanford Univ, Div Cardiovasc Med, Dept Med, Sch Med, Stanford, CA 94305 USA
[6] Stanford Cardiovasc Inst, Stanford, CA USA
来源
JOURNAL OF THE AMERICAN HEART ASSOCIATION | 2021年 / 10卷 / 23期
基金
美国国家卫生研究院;
关键词
cluster analysis; coronary artery disease; machine learning; phenotype discovery; ANKLE-BRACHIAL INDEX; MORTALITY RISK PREDICTION; HEART-DISEASE; CARDIOVASCULAR RISK; GENETIC RISK; ASSOCIATION; VALIDATION; PHENOTYPES; METAANALYSIS; OUTCOMES;
D O I
10.1161/JAHA.121.021976
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K--means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all-cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All-cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle-aged/ healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle-aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment.
引用
收藏
页数:33
相关论文
共 45 条
  • [41] Phenomapping for Novel Classification of Heart Failure With Preserved Ejection Fraction
    Shah, Sanjiv J.
    Katz, Daniel H.
    Selvaraj, Senthil
    Burke, Michael A.
    Yancy, Clyde W.
    Gheorghiade, Mihai
    Bonow, Robert O.
    Huang, Chiang-Ching
    Deo, Rahul C.
    [J]. CIRCULATION, 2015, 131 (03) : 269 - +
  • [42] Sirugo G, 2019, CELL, V177, P26, DOI [10.1016/j.cell.2019.02.048, 10.1016/j.cell.2019.04.032]
  • [43] Heterogeneous impact of classic atherosclerotic risk factors on different arterial territories: the EPIC-Norfolk prospective population study
    Stoekenbroek, Robert M.
    Boekholdt, S. Matthijs
    Luben, Robert
    Hovingh, G. Kees
    Zwinderman, Aeilko H.
    Wareham, Nicholas J.
    Khaw, Kay-Tee
    Peters, Ron J. G.
    [J]. EUROPEAN HEART JOURNAL, 2016, 37 (11) : 880 - 889
  • [44] Genetic determinants of the ankle-brachial index: A meta-analysis of a cardiovascular candidate gene 50K SNP panel in the candidate gene association resource (CARe) consortium
    Wassel, Christina L.
    Lamina, Claudia
    Nambi, Vijay
    Coassin, Stefan
    Mukamal, Kenneth J.
    Ganesh, Santhi K.
    Jacobs, David R., Jr.
    Franceschini, Nora
    Papanicolaou, George J.
    Gibson, Quince
    Yanek, Lisa R.
    van der Harst, Pim
    Ferguson, Jane F.
    Crawford, Dana C.
    Waite, Lindsay L.
    Allison, Matthew A.
    Criqui, Michael H.
    McDermott, Mary M.
    Mehra, Reena
    Cupples, L. Adrienne
    Hwang, Shih-Jen
    Redline, Susan
    Kaplan, Robert C.
    Heiss, Gerardo
    Rotter, Jerome I.
    Boerwinkle, Eric
    Taylor, Herman A.
    Eraso, Luis H.
    Haun, Margot
    Li, Mingyao
    Meisinger, Christa
    O'Connell, Jeffrey R.
    Shuldineri, Alan R.
    Tybjaerg-Hansen, Anne
    Frikke-Schmidt, Ruth
    Kollerits, Barbara
    Rantner, Barbara
    Dieplinger, Benjamin
    Stadler, Marietta
    Mueller, Thomas
    Haltmayer, Meinhard
    Klein-Weigel, Peter
    Summerer, Monika
    Wichmann, H. -Erich
    Asselbergs, Folkert W.
    Navis, Gerjan
    Leach, Irene Mateo
    Brown-Gentry, Kristin
    Goodloe, Robert
    Assimes, Themistocles L.
    [J]. ATHEROSCLEROSIS, 2012, 222 (01) : 138 - 147
  • [45] Low lifetime recreational activity is a risk factor for peripheral arterial disease
    Wilson, Andrew M.
    Sadrzadeh-Rafie, Amir H.
    Myers, Jonathan
    Assimes, Themistocles
    Nead, Kevin T.
    Higgins, Mamie
    Gabriel, Andre
    Olin, Jeffrey
    Cooke, John P.
    [J]. JOURNAL OF VASCULAR SURGERY, 2011, 54 (02) : 427 - 432