A population-based study exploring phenotypic clusters and clinical outcomes in stroke using unsupervised machine learning

被引:4
作者
Akyea, Ralph K. [1 ]
Ntaios, George [2 ]
Kontopantelis, Evangelos [3 ,4 ]
Georgiopoulos, Georgios [5 ]
Soria, Daniele [6 ]
Asselbergs, Folkert W. [7 ,8 ]
Kai, Joe [1 ]
Weng, Stephen F. [1 ]
Qureshi, Nadeem [1 ]
机构
[1] Univ Nottingham, Ctr Acad Primary Care, Sch Med, PRISM Res Grp, Nottingham, England
[2] Univ Thessaly, Fac Med, Sch Hlth Sci, Dept Internal Med, Larisa, Greece
[3] Univ Manchester, Manchester Acad Hlth Sci Ctr MAHSC, Div Populat Hlth Hlth Serv Res & Primary Care, Sch Hlth Sci,Fac Biol Med & Hlth, Manchester, England
[4] Univ Manchester, Div Informat Imaging & Data Sci, Manchester Acad Hlth Sci Ctr MAHSC, Fac Biol Med & Hlth,Sch Hlth Sci, Manchester, England
[5] Kings Coll London, St Thomas Hosp, Sch Biomed Engn & Imaging Sci, London, England
[6] Univ Kent, Sch Comp, Canterbury, England
[7] Univ Amsterdam, Amsterdam Univ Med Ctr, Dept Cardiol, Amsterdam, Netherlands
[8] UCL, Hlth Data Res & Inst Hlth Informat, London, England
来源
PLOS DIGITAL HEALTH | 2023年 / 2卷 / 09期
关键词
ASSOCIATION; SELECTION; RISK;
D O I
10.1371/journal.pdig.0000334
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Individuals developing stroke have varying clinical characteristics, demographic, and biochemical profiles. This heterogeneity in phenotypic characteristics can impact on cardiovascular disease (CVD) morbidity and mortality outcomes. This study uses a novel clustering approach to stratify individuals with incident stroke into phenotypic clusters and evaluates the differential burden of recurrent stroke and other cardiovascular outcomes. We used linked clinical data from primary care, hospitalisations, and death records in the UK. A data- driven clustering analysis (kamila algorithm) was used in 48,114 patients aged > 18 years with incident stroke, from 1-Jan-1998 to 31-Dec-2017 and no prior history of serious vascular events. Cox proportional hazards regression was used to estimate hazard ratios (HRs) for subsequent adverse outcomes, for each of the generated clusters. Adverse outcomes included coronary heart disease (CHD), recurrent stroke, peripheral vascular disease (PVD), heart failure, CVD-related and all-cause mortality. Four distinct phenotypes with varying underlying clinical characteristics were identified in patients with incident stroke. Compared with cluster 1 (n = 5,201, 10.8%), the risk of composite recurrent stroke and CVD-related mortality was higher in the other 3 clusters (cluster 2 [n =18,655, 38.8%]: hazard ratio [HR], 1.07; 95% CI, 1.02-1.12; cluster 3 [n = 10,244, 21.3%]: HR, 1.20; 95% CI, 1.14-1.26; and cluster 4 [n =14,014, 29.1%]: HR, 1.44; 95% CI: 1.37-1.50). Similar trends in risk were observed for composite recurrent stroke and all-cause mortality outcome, and subsequent recurrent stroke outcome. However, results were not consistent for subsequent risk in CHD, PVD, heart failure, CVD-related mortality, and all-cause mortality. In this proof of principle study, we demonstrated how a heterogenous population of patients with incident stroke can be stratified into four relatively homogenous phenotypes with differential risk of recurrent and major cardiovascular outcomes. This offers an opportunity to revisit the stratification of care for patients with incident stroke to improve patient outcomes.
引用
收藏
页数:16
相关论文
共 43 条
[1]   Novel subgroups of adult-onset diabetes and their association with outcomes: a data-driven cluster analysis of six variables [J].
Ahlqvist, Emma ;
Storm, Petter ;
Karajamaki, Annemari ;
Martinell, Mats ;
Dorkhan, Mozhgan ;
Carlsson, Annelie ;
Vikman, Petter ;
Prasad, Rashmi B. ;
Aly, Dina Mansour ;
Almgren, Peter ;
Wessman, Ylva ;
Shaat, Nael ;
Spegel, Peter ;
Mulder, Hindrik ;
Lindholm, Eero ;
Melander, Olle ;
Hansson, Ola ;
Malmqvist, Ulf ;
Lernmark, Ake ;
Lahti, Kaj ;
Forsen, Tom ;
Tuomi, Tiinamaija ;
Rosengren, Anders H. ;
Groop, Leif .
LANCET DIABETES & ENDOCRINOLOGY, 2018, 6 (05) :361-369
[2]   Clinical Implications of Chronic Heart Failure Phenotypes Defined by Cluster Analysis [J].
Ahmad, Tariq ;
Pencina, Michael J. ;
Schulte, Phillip J. ;
O'Brien, Emily ;
Whellan, David J. ;
Pina, Ileana L. ;
Kitzman, Dalane W. ;
Lee, Kerry L. ;
O'Connor, Christopher M. ;
Felker, G. Michael .
JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2014, 64 (17) :1765-1774
[3]   Sex, Age, and Socioeconomic Differences in Nonfatal Stroke Incidence and Subsequent Major Adverse Outcomes [J].
Akyea, Ralph K. ;
Vinogradova, Yana ;
Qureshi, Nadeem ;
Patel, Riyaz S. ;
Kontopantelis, Evangelos ;
Ntaios, George ;
Asselbergs, Folkert W. ;
Kai, Joe ;
Weng, Stephen F. .
STROKE, 2021, 52 (02) :396-405
[4]   The curse(s) of dimensionality [J].
Altman, Naomi ;
Krzywinski, Martin .
NATURE METHODS, 2018, 15 (06) :399-400
[5]  
[Anonymous], 2019, Treatment episode data set (TEDS): 2017. Admissions to and discharges from publicly-funded substance use treatment
[6]  
[Anonymous], 2015, ENGLISH INDICES DEPR
[7]  
[Anonymous], Office for National Statistics (2019). Available at https://www.ons.gov.uk/peoplepopulationandcommunity/culturalidentity [ Accessed: 12 March 2019).
[8]   Lower target blood pressures are safe and effective for the prevention of recurrent stroke: the PROGRESS trial [J].
Arima, Hisatomi ;
Chalmers, John ;
Woodward, Mark ;
Anderson, Craig ;
Rodgers, Anthony ;
Davis, Stephen ;
MacMahon, Stephen ;
Neal, Bruce .
JOURNAL OF HYPERTENSION, 2006, 24 (06) :1201-1208
[9]   Physical Activity and Exercise Recommendations for Stroke Survivors A Statement for Healthcare Professionals From the American Heart Association/American Stroke Association [J].
Billinger, Sandra A. ;
Arena, Ross ;
Bernhardt, Julie ;
Eng, Janice J. ;
Franklin, Barry A. ;
Johnson, Cheryl Mortag ;
MacKay-Lyons, Marilyn ;
Macko, Richard F. ;
Mead, Gillian E. ;
Roth, Elliot J. ;
Shaughnessy, Marianne ;
Tang, Ada .
STROKE, 2014, 45 (08) :2532-2553
[10]  
CPRD @ Cambridge, Codes Lists (GOLD)