Detection of Monkeypox Cases Based on Symptoms Using XGBoost and Shapley Additive Explanations Methods

被引:20
作者
Farzipour, Alireza [1 ]
Elmi, Roya [2 ]
Nasiri, Hamid [3 ]
机构
[1] Semnan Univ, Dept Comp Sci, Semnan 3513119111, Iran
[2] Semnan Univ, Farzanegan Campus, Semnan 3519734851, Iran
[3] Amirkabir Univ Technol, Dept Comp Engn, Tehran Polytech, Tehran 1591634311, Iran
关键词
monkeypox; XGBoost; SHAP; MPXV; machine learning;
D O I
10.3390/diagnostics13142391
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
The monkeypox virus poses a novel public health risk that might quickly escalate into a worldwide epidemic. Machine learning (ML) has recently shown much promise in diagnosing diseases like cancer, finding tumor cells, and finding COVID-19 patients. In this study, we have created a dataset based on the data both collected and published by Global Health and used by the World Health Organization (WHO). Being entirely textual, this dataset shows the relationship between the symptoms and the monkeypox disease. The data have been analyzed, using gradient boosting methods such as Extreme Gradient Boosting (XGBoost), CatBoost, and LightGBM along with other standard machine learning methods such as Support Vector Machine (SVM) and Random Forest. All these methods have been compared. The research aims to provide an ML model based on symptoms for the diagnosis of monkeypox. Previous studies have only examined disease diagnosis using images. The best performance has belonged to XGBoost, with an accuracy of 1.0 in reviews. To check the model's flexibility, k-fold cross-validation is used, reaching an average accuracy of 0.9 in 5 different splits of the test set. In addition, Shapley Additive Explanations (SHAP) helps in examining and explaining the output of the XGBoost model.
引用
收藏
页数:16
相关论文
共 50 条
[41]   An explainable predictive model for suicide attempt risk using an ensemble learning and Shapley Additive Explanations (SHAP) approach [J].
Nordin, Noratikah ;
Zainol, Zurinahni ;
Noor, Mohd Halim Mohd ;
Chan, Lai Fong .
ASIAN JOURNAL OF PSYCHIATRY, 2023, 79
[42]   Online Education vs Traditional Education: Analysis of Student Performance in Computer Science using Shapley Additive Explanations [J].
Charytanowicz, Malgorzata .
INFORMATICS IN EDUCATION, 2023, 22 (03) :351-368
[43]   A model for predicting academic performance on standardised tests for lagging regions based on machine learning and Shapley additive explanations [J].
Suaza-Medina, Mario ;
Penabaena-Niebles, Rita ;
Jubiz-Diaz, Maria .
SCIENTIFIC REPORTS, 2024, 14 (01)
[44]   Predicting egg production rate and egg weight of broiler breeders based on machine learning and Shapley additive explanations [J].
Ji, Hengyi ;
Xu, Yidan ;
Teng, Guanghui .
POULTRY SCIENCE, 2025, 104 (01)
[45]   Enhancing co-pyrolysis process of biomass and coal using machine learning insights and Shapley additive explanations based on cooperative game theory [J].
Le, Quang Dung ;
Paramasivam, Prabhu ;
Chohan, Jasgurpreet Singh ;
Sirohi, Ranjana ;
Bui, Van Hung ;
Kowalski, Jerzy ;
Le, Huu Cuong ;
Tran, Viet Dung .
ENERGY & ENVIRONMENT, 2025,
[46]   Machine learning prediction of metabolic dysfunction-associated fatty liver disease risk in American adults using body composition: explainable analysis based on SHapley Additive exPlanations [J].
Hong, Yan ;
Chen, Xinrong ;
Wang, Ling ;
Zhang, Fan ;
Zeng, Ziying ;
Xie, Weining .
FRONTIERS IN NUTRITION, 2025, 12
[47]   Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach [J].
Mangalathu, Sujith ;
Hwang, Seong-Hoon ;
Jeon, Jong-Su .
ENGINEERING STRUCTURES, 2020, 219
[48]   Spatial Mapping and Prediction of Groundwater Quality Using Ensemble Learning Models and SHapley Additive exPlanations with Spatial Uncertainty Analysis [J].
Yang, Shilong ;
Luo, Danyuan ;
Tan, Jiayao ;
Li, Shuyi ;
Song, Xiaoqing ;
Xiong, Ruihan ;
Wang, Jinghan ;
Ma, Chuanming ;
Xiong, Hanxiang .
WATER, 2024, 16 (17)
[49]   Prediction of gully erosion susceptibility through the lens of the SHapley Additive exPlanations (SHAP) method using a stacking ensemble model [J].
Han, Jeongho ;
Guzman, Jorge A. ;
Chu, Maria L. .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 383
[50]   Understanding Arteriosclerotic Heart Disease Patients Using Electronic Health Records: A Machine Learning and Shapley Additive exPlanations Approach [J].
Miranda, Eka ;
Adiarto, Suko ;
Bhatti, Faqir M. ;
Zakiyyah, Alfi Yusrotis ;
Aryuni, Mediana ;
Bernando, Charles .
HEALTHCARE INFORMATICS RESEARCH, 2023, 29 (03) :228-238