Unsupervised Machine Learning to Identify High Likelihood of Dementia in Population-Based Surveys: Development and Validation Study

被引:46
作者
de langavant, Laurent Cleret [1 ,2 ,3 ,4 ,5 ]
Bayen, Eleonore [5 ,6 ,7 ,8 ]
Yaffe, Kristine [5 ,9 ]
机构
[1] Hop Henri Mondor, AP HP, Serv Neurol, 51 Ave Marechal Lattre Tassigny,3eme Etage, F-94000 Creteil, France
[2] Univ Paris Est, Fac Med, Dept Neurol, Creteil, France
[3] INSERM, Neuropsychol Intervent U955 EQ1, Inst Mondor Rech Biomed, Creteil, France
[4] PSL Res Univ, Dept Etud Cognit, Ecole Normale Super, Paris, France
[5] Univ Calif San Francisco, Global Brain Hlth Inst, Memory & Aging Ctr, San Francisco, CA 94143 USA
[6] Hop La Pitie Salpetriere, AP HP, Serv Medec Phys & Readaptat, Paris, France
[7] Sorbonne Univ, Fac Med, Med Phys & Readaptat, Paris, France
[8] Univ Paris 09, Lab Econ & Gest Org Sante, Dept Econ Sante, Paris, France
[9] Univ Calif San Francisco, Dept Psychiat Neurol & Epidemiol & Biostat, Ctr Populat Brain Hlth, San Francisco, CA 94143 USA
关键词
dementia; cognition disorders; health surveys; electronic health records; diagnosis; unsupervised machine learning; cluster analysis; data mining; COGNITIVE DECLINE; R-PACKAGE; RISK; UNDERDIAGNOSIS;
D O I
10.2196/10493
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Dementia is increasing in prevalence worldwide, yet frequently remains undiagnosed, especially in low-and middle-income countries. Population-based surveys represent an underinvestigated source to identify individuals at risk of dementia. Objective: The aim is to identify participants with high likelihood of dementia in population-based surveys without the need of the clinical diagnosis of dementia in a subsample. Methods: Unsupervised machine learning classification (hierarchical clustering on principal components) was developed in the Health and Retirement Study (HRS; 2002-2003, N=18,165 individuals) and validated in the Survey of Health, Ageing and Retirement in Europe (SHARE; 2010-2012, N=58,202 individuals). Results: Unsupervised machine learning classification identified three clusters in HRS: cluster 1 (n=12,231) without any functional or motor limitations, cluster 2 (N=4841) with walking/climbing limitations, and cluster 3 (N=1093) with both functional and walking/climbing limitations. Comparison of cluster 3 with previously published predicted probabilities of dementia in HRS showed that it identified high likelihood of dementia (probability of dementia >0.95; area under the curve [AUC]=0.91). Removing either cognitive or both cognitive and behavioral measures did not impede accurate classification (AUC=0.91 and AUC=0.90, respectively). Three clusters with similar profiles were identified in SHARE (cluster 1: n=40,223; cluster 2: n=15,644; cluster 3: n=2335). Survival rate of participants from cluster 3 reached 39.2% (n=665 deceased) in HRS and 62.2% (n=811 deceased) in SHARE after a 3.9-year follow-up. Surviving participants from cluster 3 in both cohorts worsened their functional and mobility performance over the same period. Conclusions: Unsupervised machine learning identifies high likelihood of dementia in population-based surveys, even without cognitive and behavioral measures and without the need of clinical diagnosis of dementia in a subsample of the population. This method could be used to tackle the global challenge of dementia.
引用
收藏
页数:12
相关论文
共 24 条
[1]  
[Anonymous], 2012, Dementia: a public health priority internet
[2]  
[Anonymous], 2018, ECONOMETRIC ANAL
[3]   Neurocognitive Disorders in DSM-5 [J].
Blazer, Dan .
AMERICAN JOURNAL OF PSYCHIATRY, 2013, 170 (06) :585-587
[4]   Unintended Consequences of Machine Learning in Medicine [J].
Cabitza, Federico ;
Rasoini, Raffaele ;
Gensini, Gian Franco .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (06) :517-518
[5]  
Charrad M, 2014, J STAT SOFTW, V61, P1
[6]   Machine Learning and Prediction in Medicine - Beyond the Peak of Inflated Expectations [J].
Chen, Jonathan H. ;
Asch, Steven M. .
NEW ENGLAND JOURNAL OF MEDICINE, 2017, 376 (26) :2507-2509
[7]   Underdiagnosis of dementia in primary care: Variations in the observed prevalence and comparisons to the expected prevalence [J].
Connolly, Amanda ;
Gaehl, Ella ;
Martin, Helen ;
Morris, Julie ;
Purandare, Nitin .
AGING & MENTAL HEALTH, 2011, 15 (08) :978-984
[8]  
Dias Amit, 2009, Indian J Psychiatry, V51 Suppl 1, pS93
[9]  
Hurd MD, 2013, NEW ENGL J MED, V369, P489, DOI [10.1056/NEJMsa1204629, 10.1056/NEJMc1305541]
[10]   The cost of diagnosing dementia in a community setting [J].
Jedenius, Erik ;
Wimo, Anders ;
Stromqvist, Jan ;
Jonsson, Linus ;
Andreasen, Niels .
INTERNATIONAL JOURNAL OF GERIATRIC PSYCHIATRY, 2010, 25 (05) :476-482