Unsupervised Learning for Automated Detection of Coronary Artery Disease Subgroups

被引:24
作者
Flores, Alyssa M. [1 ]
Schuler, Alejandro [2 ]
Eberhard, Anne Verena [1 ]
Olin, Jeffrey W. [3 ]
Cooke, John P. [4 ]
Leeper, Nicholas J. [1 ,5 ,6 ]
Shah, Nigam H. [2 ]
Ross, Elsie G. [1 ,2 ,6 ]
机构
[1] Stanford Univ, Dept Surg, Div Vasc Surg, Sch Med, Stanford, CA 94305 USA
[2] Stanford Univ, Ctr Biomed Informat Res, Stanford, CA 94305 USA
[3] Icahn Sch Med Mt Sinai, Zena & Michael A Wiener Cardiovasc Inst, Marie Josee & Henry R Kravis Ctr Cardiovasc Hlth, New York, NY 10029 USA
[4] Houston Methodist Res Inst, Dept Cardiovasc Sci, Houston, TX USA
[5] Stanford Univ, Div Cardiovasc Med, Dept Med, Sch Med, Stanford, CA 94305 USA
[6] Stanford Cardiovasc Inst, Stanford, CA USA
来源
JOURNAL OF THE AMERICAN HEART ASSOCIATION | 2021年 / 10卷 / 23期
基金
美国国家卫生研究院;
关键词
cluster analysis; coronary artery disease; machine learning; phenotype discovery; ANKLE-BRACHIAL INDEX; MORTALITY RISK PREDICTION; HEART-DISEASE; CARDIOVASCULAR RISK; GENETIC RISK; ASSOCIATION; VALIDATION; PHENOTYPES; METAANALYSIS; OUTCOMES;
D O I
10.1161/JAHA.121.021976
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND: The promise of precision population health includes the ability to use robust patient data to tailor prevention and care to specific groups. Advanced analytics may allow for automated detection of clinically informative subgroups that account for clinical, genetic, and environmental variability. This study sought to evaluate whether unsupervised machine learning approaches could interpret heterogeneous and missing clinical data to discover clinically important coronary artery disease subgroups. METHODS AND RESULTS: The Genetic Determinants of Peripheral Arterial Disease study is a prospective cohort that includes individuals with newly diagnosed and/or symptomatic coronary artery disease. We applied generalized low rank modeling and K--means cluster analysis using 155 phenotypic and genetic variables from 1329 participants. Cox proportional hazard models were used to examine associations between clusters and major adverse cardiovascular and cerebrovascular events and all-cause mortality. We then compared performance of risk stratification based on clusters and the American College of Cardiology/American Heart Association pooled cohort equations. Unsupervised analysis identified 4 phenotypically and prognostically distinct clusters. All-cause mortality was highest in cluster 1 (oldest/most comorbid; 26%), whereas major adverse cardiovascular and cerebrovascular event rates were highest in cluster 2 (youngest/multiethnic; 41%). Cluster 4 (middle-aged/ healthiest behaviors) experienced more incident major adverse cardiovascular and cerebrovascular events (30%) than cluster 3 (middle-aged/lowest medication adherence; 23%), despite apparently similar risk factor and lifestyle profiles. In comparison with the pooled cohort equations, cluster membership was more informative for risk assessment of myocardial infarction, stroke, and mortality. CONCLUSIONS: Unsupervised clustering identified 4 unique coronary artery disease subgroups with distinct clinical trajectories. Flexible unsupervised machine learning algorithms offer the ability to meaningfully process heterogeneous patient data and provide sharper insights into disease characterization and risk assessment.
引用
收藏
页数:33
相关论文
共 45 条
  • [1] Machine Learning Methods Improve Prognostication, Identify Clinically Distinct Phenotypes, and Detect Heterogeneity in Response to Therapy in a Large Cohort of Heart Failure Patients
    Ahmad, Tariq
    Lund, Lars H.
    Rao, Pooja
    Ghosh, Rohit
    Warier, Prashant
    Vaccaro, Benjamin
    Dahlstrom, Ulf
    O'Connor, Christopher M.
    Felker, G. Michael
    Desai, Nihar R.
    [J]. JOURNAL OF THE AMERICAN HEART ASSOCIATION, 2018, 7 (08):
  • [2] Clinical Implications of Chronic Heart Failure Phenotypes Defined by Cluster Analysis
    Ahmad, Tariq
    Pencina, Michael J.
    Schulte, Phillip J.
    O'Brien, Emily
    Whellan, David J.
    Pina, Ileana L.
    Kitzman, Dalane W.
    Lee, Kerry L.
    O'Connor, Christopher M.
    Felker, G. Michael
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2014, 64 (17) : 1765 - 1774
  • [3] [Anonymous], 2014, ARXIV PREPRINT ARXIV
  • [4] BLASHFIELD RK, 1991, J CLASSIF, V8, P277
  • [5] Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
    Burton, Paul R.
    Clayton, David G.
    Cardon, Lon R.
    Craddock, Nick
    Deloukas, Panos
    Duncanson, Audrey
    Kwiatkowski, Dominic P.
    McCarthy, Mark I.
    Ouwehand, Willem H.
    Samani, Nilesh J.
    Todd, John A.
    Donnelly, Peter
    Barrett, Jeffrey C.
    Davison, Dan
    Easton, Doug
    Evans, David
    Leung, Hin-Tak
    Marchini, Jonathan L.
    Morris, Andrew P.
    Spencer, Chris C. A.
    Tobin, Martin D.
    Attwood, Antony P.
    Boorman, James P.
    Cant, Barbara
    Everson, Ursula
    Hussey, Judith M.
    Jolley, Jennifer D.
    Knight, Alexandra S.
    Koch, Kerstin
    Meech, Elizabeth
    Nutland, Sarah
    Prowse, Christopher V.
    Stevens, Helen E.
    Taylor, Niall C.
    Walters, Graham R.
    Walker, Neil M.
    Watkins, Nicholas A.
    Winzer, Thilo
    Jones, Richard W.
    McArdle, Wendy L.
    Ring, Susan M.
    Strachan, David P.
    Pembrey, Marcus
    Breen, Gerome
    St Clair, David
    Caesar, Sian
    Gordon-Smith, Katherine
    Jones, Lisa
    Fraser, Christine
    Green, Elain K.
    [J]. NATURE, 2007, 447 (7145) : 661 - 678
  • [6] The 9p21 Myocardial Infarction Risk Allele Increases Risk of Peripheral Artery Disease in Older People
    Cluett, Christie
    McDermott, Mary McGrae
    Guralnik, Jack
    Ferrucci, Luigi
    Bandinelli, Stefania
    Miljkovic, Iva
    Zmuda, Joseph M.
    Li, Rongling
    Tranah, Greg
    Harris, Tamara
    Rice, Neil
    Henley, William
    Frayling, Timothy M.
    Murray, Anna
    Melzer, David
    [J]. CIRCULATION-CARDIOVASCULAR GENETICS, 2009, 2 (04) : 347 - 353
  • [7] Validation of the Framingham Coronary Heart Disease prediction scores - Results of a multiple ethnic groups investigation
    D'Agostino, RB
    Grundy, S
    Sullivan, LM
    Wilson, P
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2001, 286 (02): : 180 - 187
  • [8] Clustering Algorithms: On Learning, Validation, Performance, and Applications to Genomics
    Dalton, Lori
    Ballarin, Virginia
    Brun, Marcel
    [J]. CURRENT GENOMICS, 2009, 10 (06) : 430 - 445
  • [9] Artificial Intelligence in Cardiovascular Imaging JACC State-of-the-Art Review
    Dey, Damini
    Slomka, Piotr J.
    Leeson, Paul
    Comaniciu, Dorin
    Shrestha, Sirish
    Sengupta, Partho P.
    Marwick, Thomas H.
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 73 (11) : 1317 - 1335
  • [10] Cigarette Smoking, Smoking Cessation, and Long-Term Risk of 3 Major Atherosclerotic Diseases
    Ding, Ning
    Sang, Yingying
    Chen, Jingsha
    Ballew, Shoshana H.
    Kalbaugh, Corey A.
    Salameh, Maya J.
    Blaha, Michael J.
    Allison, Matthew
    Heiss, Gerardo
    Selvin, Elizabeth
    Coresh, Josef
    Matsushita, Kunihiro
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2019, 74 (04) : 495 - 504