A multilayer multimodal detection and prediction model based on explainable artificial intelligence for Alzheimer's disease

被引:166
作者
El-Sappagh, Shaker [1 ,2 ]
Alonso, Jose M. [3 ]
Islam, S. M. Riazul [4 ]
Sultan, Ahmad M. [5 ]
Kwak, Kyung Sup [6 ]
机构
[1] Univ Santiago de Compostela, Ctr Singular Invest Tecnol Intelixentes CiTIUS, Santiago De Compostela 15782, Spain
[2] Benha Univ, Fac Comp & Artificial Intelligence, Informat Syst Dept, Banha 13518, Egypt
[3] Univ Santiago de Compostela, Ctr Singular Invest Tecnol Intelixentes, Santiago 15703, Spain
[4] Sejong Univ, Dept Comp Sci & Engn, 209 Neungdong Ro, Seoul 05006, South Korea
[5] Mansoura Univ, Fac Med, Gastrointestinal Surg Ctr, Mansoura 35516, Egypt
[6] Inha Univ, Dept Informat & Commun Engn, Incheon 22212, South Korea
基金
新加坡国家研究基金会; 加拿大健康研究院; 美国国家卫生研究院;
关键词
MILD COGNITIVE IMPAIRMENT; EARLY-DIAGNOSIS; NEURAL-NETWORKS; PROGRESSION; CLASSIFICATION; CLASSIFIERS; CONVERSION; MCI;
D O I
10.1038/s41598-021-82098-3
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Alzheimer's disease (AD) is the most common type of dementia. Its diagnosis and progression detection have been intensively studied. Nevertheless, research studies often have little effect on clinical practice mainly due to the following reasons: (1) Most studies depend mainly on a single modality, especially neuroimaging; (2) diagnosis and progression detection are usually studied separately as two independent problems; and (3) current studies concentrate mainly on optimizing the performance of complex machine learning models, while disregarding their explainability. As a result, physicians struggle to interpret these models, and feel it is hard to trust them. In this paper, we carefully develop an accurate and interpretable AD diagnosis and progression detection model. This model provides physicians with accurate decisions along with a set of explanations for every decision. Specifically, the model integrates 11 modalities of 1048 subjects from the Alzheimer's Disease Neuroimaging Initiative (ADNI) real-world dataset: 294 cognitively normal, 254 stable mild cognitive impairment (MCI), 232 progressive MCI, and 268 AD. It is actually a two-layer model with random forest (RF) as classifier algorithm. In the first layer, the model carries out a multi-class classification for the early diagnosis of AD patients. In the second layer, the model applies binary classification to detect possible MCI-to-AD progression within three years from a baseline diagnosis. The performance of the model is optimized with key markers selected from a large set of biological and clinical measures. Regarding explainability, we provide, for each layer, global and instance-based explanations of the RF classifier by using the SHapley Additive exPlanations (SHAP) feature attribution framework. In addition, we implement 22 explainers based on decision trees and fuzzy rule-based systems to provide complementary justifications for every RF decision in each layer. Furthermore, these explanations are represented in natural language form to help physicians understand the predictions. The designed model achieves a cross-validation accuracy of 93.95% and an F1-score of 93.94% in the first layer, while it achieves a cross-validation accuracy of 87.08% and an F1-Score of 87.09% in the second layer. The resulting system is not only accurate, but also trustworthy, accountable, and medically applicable, thanks to the provided explanations which are broadly consistent with each other and with the AD medical literature. The proposed system can help to enhance the clinical understanding of AD diagnosis and progression processes by providing detailed insights into the effect of different modalities on the disease risk.
引用
收藏
页数:26
相关论文
共 93 条
  • [1] Peeking Inside the Black-Box: A Survey on Explainable Artificial Intelligence (XAI)
    Adadi, Amina
    Berrada, Mohammed
    [J]. IEEE ACCESS, 2018, 6 : 52138 - 52160
  • [2] On the early diagnosis of Alzheimer's Disease from multimodal signals: A survey
    Alberdi, Ane
    Aztiria, Asier
    Basarab, Adrian
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 71 : 1 - 29
  • [3] Alonso J.M., 2013, Data Mining: Concepts, Methodologies, Tools, and Applications: IGI Global, P1064
  • [4] Alonso J.M., 2019, 2019 IEEE INT C FUZZ, P1
  • [5] A Bibliometric Analysis of the Explainable Artificial Intelligence Research Field
    Alonso, Jose M.
    Castiello, Ciro
    Mencar, Corrado
    [J]. INFORMATION PROCESSING AND MANAGEMENT OF UNCERTAINTY IN KNOWLEDGE-BASED SYSTEMS: THEORY AND FOUNDATIONS, IPMU 2018, PT I, 2018, 853 : 3 - 15
  • [6] Alonso JM, 2015, SPRINGER HANDBOOK OF COMPUTATIONAL INTELLIGENCE, P219
  • [7] [Anonymous], 2015, FUZZY LOGIC INTRO CO
  • [8] [Anonymous], 2017, Journal of Royal Society Interface, DOI [DOI 10.1101/142760, 10.1101/142760]
  • [9] Biomarkers for Alzheimer's disease: current status and prospects for the future
    Blennow, K.
    Zetterberg, H.
    [J]. JOURNAL OF INTERNAL MEDICINE, 2018, 284 (06) : 643 - 663
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32