An explainable machine learning based prediction model for Alzheimer's disease in China longitudinal aging study

被引:2
作者
Yue, Ling [1 ]
Chen, Wu-gang [2 ,3 ]
Liu, Sai-chao [2 ,3 ]
Chen, Sheng-bo [2 ,3 ]
Xiao, Shi-fu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Med, Shanghai Mental Hlth Ctr, Dept Geriatr Psychiat, Shanghai, Peoples R China
[2] Henan Univ, Sch Comp & Informat Engn, Kaifeng, Peoples R China
[3] Henan Univ, Henan Engn Res Ctr Intelligent Technol & Applicat, Kaifeng, Peoples R China
来源
FRONTIERS IN AGING NEUROSCIENCE | 2023年 / 15卷
基金
中国国家自然科学基金;
关键词
Alzheimer's disease; mild cognitive impairment; ensemble learning; feature selection; explainable AI;
D O I
10.3389/fnagi.2023.1267020
中图分类号
R592 [老年病学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 100203 ;
摘要
Alzheimer's disease (AD) is the most common cause of dementia. Accurate prediction and diagnosis of AD and its prodromal stage, i.e., mild cognitive impairment (MCI), is essential for the possible delay and early treatment for the disease. In this paper, we adopt the data from the China Longitudinal Aging Study (CLAS), which was launched in 2011, and includes a joint effort of 15 institutions all over the country. Four thousand four hundred and eleven people who are at least 60 years old participated in the project, where 3,514 people completed the baseline survey. The survey collected data including demographic information, daily lifestyle, medical history, and routine physical examination. In particular, we employ ensemble learning and feature selection methods to develop an explainable prediction model for AD and MCI. Five feature selection methods and nine machine learning classifiers are applied for comparison to find the most dominant features on AD/MCI prediction. The resulting model achieves accuracy of 89.2%, sensitivity of 87.7%, and specificity of 90.7% for MCI prediction, and accuracy of 99.2%, sensitivity of 99.7%, and specificity of 98.7% for AD prediction. We further utilize the SHapley Additive exPlanations (SHAP) algorithm to visualize the specific contribution of each feature to AD/MCI prediction at both global and individual levels. Consequently, our model not only provides the prediction outcome, but also helps to understand the relationship between lifestyle/physical disease history and cognitive function, and enables clinicians to make appropriate recommendations for the elderly. Therefore, our approach provides a new perspective for the design of a computer-aided diagnosis system for AD and MCI, and has potential high clinical application value.
引用
收藏
页数:15
相关论文
共 55 条
  • [1] Feature Selection Using Information Gain for Improved Structural-Based Alert Correlation
    Alhaj, Taqwa Ahmed
    Siraj, Maheyzah Md
    Zainal, Anazida
    Elshoush, Huwaida Tagelsir
    Elhaj, Fatin
    [J]. PLOS ONE, 2016, 11 (11):
  • [2] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [3] [Anonymous], 2000, Quick Reference to the Diagnostic Criteria From DSM-IV-TR
  • [4] Smoking as a risk factor for dementia and cognitive decline: a meta-analysis of prospective studies
    Anstey, Kaarin J.
    von Sanden, Chwee
    Salim, Agus
    O'Kearney, Richard
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2007, 166 (04) : 367 - 378
  • [5] Dietary patterns and risk of dementia - The three-city cohort study
    Barberger-Gateau, P.
    Raffaitin, C.
    Letenneur, L.
    Berr, C.
    Tzourio, C.
    Dartigues, J. F.
    Alperovitch, A.
    [J]. NEUROLOGY, 2007, 69 (20) : 1921 - 1930
  • [6] Bergstra J., 2013, Proceedings of the 12th Python in Science Conference (SciPy 2013)
  • [7] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [8] Associations between self-reported sleep characteristics and incident mild cognitive impairment: The Heinz Nixdorf Recall Cohort Study
    Brachem, Christian
    Winkler, Angela
    Tebruegge, Sarah
    Weimar, Christian
    Erbel, Raimund
    Joeckel, Karl-Heinz
    Stang, Andreas
    Dragano, Nico
    Moebus, Susanne
    Kowall, Bernd
    Jokisch, Martha
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Bagging predictors
    Breiman, L
    [J]. MACHINE LEARNING, 1996, 24 (02) : 123 - 140