An Explainable Artificial Intelligence Model Proposed for the Prediction of Myalgic Encephalomyelitis/Chronic Fatigue Syndrome and the Identification of Distinctive Metabolites

被引:17
作者
Yagin, Fatma Hilal [1 ]
Alkhateeb, Abedalrhman [2 ]
Raza, Ali [3 ]
Samee, Nagwan Abdel [4 ]
Mahmoud, Noha F. [5 ]
Colak, Cemil [1 ]
Yagin, Burak [1 ]
机构
[1] Inonu Univ, Fac Med, Dept Biostat & Med Informat, TR-44280 Malatya, Turkiye
[2] Lakehead Univ, Comp Sci Dept, Thunder Bay, ON P7B 5E1, Canada
[3] Khwaja Fareed Univ Engn & Informat Technol, Inst Comp Sci, Rahim Yar Khan 64200, Pakistan
[4] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Technol, POB 84428, Riyadh 11671, Saudi Arabia
[5] Princess Nourah bint Abdulrahman Univ, Hlth & Rehabil Sci Coll, Rehabil Sci Dept, POB 84428, Riyadh 11671, Saudi Arabia
关键词
explainable artificial intelligence; myalgic encephalomyelitis/chronic fatigue syndrome; metabolomics data; clinical classification; CROSS-VALIDATION; CLASSIFICATION; OPTIMIZATION; ME/CFS;
D O I
10.3390/diagnostics13233495
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background: Myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) is a complex and debilitating illness with a significant global prevalence, affecting over 65 million individuals. It affects various systems, including the immune, neurological, gastrointestinal, and circulatory systems. Studies have shown abnormalities in immune cell types, increased inflammatory cytokines, and brain abnormalities. Further research is needed to identify consistent biomarkers and develop targeted therapies. This study uses explainable artificial intelligence and machine learning techniques to identify discriminative metabolites for ME/CFS. Material and Methods: The model investigates a metabolomics dataset of CFS patients and healthy controls, including 26 healthy controls and 26 ME/CFS patients aged 22-72. The dataset encapsulated 768 metabolites into nine metabolic super-pathways: amino acids, carbohydrates, cofactors, vitamins, energy, lipids, nucleotides, peptides, and xenobiotics. Random forest methods together with other classifiers were applied to the data to classify individuals as ME/CFS patients and healthy individuals. The classification learning algorithms' performance in the validation step was evaluated using a variety of methods, including the traditional hold-out validation method, as well as the more modern cross-validation and bootstrap methods. Explainable artificial intelligence approaches were applied to clinically explain the optimum model's prediction decisions. Results: The metabolomics of C-glycosyltryptophan, oleoylcholine, cortisone, and 3-hydroxydecanoate were determined to be crucial for ME/CFS diagnosis. The random forest model outperformed the other classifiers in ME/CFS prediction using the 1000-iteration bootstrapping method, achieving 98% accuracy, precision, recall, F1 score, 0.01 Brier score, and 99% AUC. According to the obtained results, the bootstrap validation approach demonstrated the highest classification outcomes. Conclusion: The proposed model accurately classifies ME/CFS patients based on the selected biomarker candidate metabolites. It offers a clear interpretation of risk estimation for ME/CFS, aiding physicians in comprehending the significance of key metabolomic features within the model.
引用
收藏
页数:18
相关论文
共 58 条
[1]   Extreme Gradient Boosting-Based Machine Learning Approach for Green Building Cost Prediction [J].
Alshboul, Odey ;
Shehadeh, Ali ;
Almasabha, Ghassan ;
Almuflih, Ali Saeed .
SUSTAINABILITY, 2022, 14 (11)
[2]   The association of fecal microbiota and fecal, blood serum and urine metabolites in myalgic encephalomyelitis/chronic fatigue syndrome [J].
Armstrong, Christopher W. ;
McGregor, Neil R. ;
Lewis, Donald P. ;
Butt, Henry L. ;
Gooley, Paul R. .
METABOLOMICS, 2017, 13 (01)
[3]   Explainable AI in medical imaging: An overview for clinical practitioners-Beyond saliency-based XAI approaches [J].
Borys, Katarzyna ;
Schmitt, Yasmin Alyssa ;
Nauta, Meike ;
Seifert, Christin ;
Kraemer, Nicole ;
Friedrich, Christoph M. ;
Nensa, Felix .
EUROPEAN JOURNAL OF RADIOLOGY, 2023, 162
[4]   Receiver Operating Characteristic (ROC) Area Under the Curve (AUC): A Diagnostic Measure for Evaluating the Accuracy of Predictors of Education Outcomes [J].
Bowers, Alex J. ;
Zhou, Xiaoliang .
JOURNAL OF EDUCATION FOR STUDENTS PLACED AT RISK, 2019, 24 (01) :20-46
[5]   Scientometric review of artificial intelligence for operations & maintenance of wind turbines: The past, present and future [J].
Chatterjee, Joyjit ;
Dethlefs, Nina .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2021, 144
[6]   Estimation of the dynamic modulus of asphalt concretes using random forests algorithm [J].
Daneshvar, Dana ;
Behnood, Ali .
INTERNATIONAL JOURNAL OF PAVEMENT ENGINEERING, 2022, 23 (02) :250-260
[7]   The Role of Kynurenine Pathway and NAD+ Metabolism in Myalgic Encephalomyelitis/Chronic Fatigue Syndrome [J].
Dehhaghi, Mona ;
Panahi, Hamed Kazemi Shariat ;
Kavyani, Bahar ;
Heng, Benjamin ;
Tan, Vanessa ;
Braidy, Nady ;
Guillemin, Gilles J. .
AGING AND DISEASE, 2022, 13 (03) :698-711
[8]   Statistical methods for validation of predictive models [J].
Diniz, Marcio Augusto .
JOURNAL OF NUCLEAR CARDIOLOGY, 2022, 29 (06) :3248-3255
[9]   External validation: a simulation study to compare cross-validation versus holdout or external testing to assess the performance of clinical prediction models using PET data from DLBCL patients [J].
Eertink, Jakoba J. ;
Heymans, Martijn W. ;
Zwezerijnen, Gerben J. C. ;
Zijlstra, Josee M. ;
de Vet, Henrica C. W. ;
Boellaard, Ronald .
EJNMMI RESEARCH, 2022, 12 (01)
[10]   Improvements on cross-validation: The .632+ bootstrap method [J].
Efron, B ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :548-560