A retrospective study using machine learning to develop predictive model to identify rotavirus-associated acute gastroenteritis in children

被引:0
|
作者
Paul, Sourav [1 ]
Rahman, Minhazur [2 ]
Dolley, Anutee [3 ]
Saikia, Kasturi [3 ]
Singh, Chongtham Shyamsunder [4 ]
Mohammed, Arifullah [5 ]
Muteeb, Ghazala [6 ]
Sarmah, Rosy [7 ]
Namsa, Nima D. [3 ]
机构
[1] Natl Inst Technol, Dept Biotechnol, Durgapur, West Bengal, India
[2] Tezpur Univ, Dept Comp Sci & Engn, Napaam, Assam, India
[3] Tezpur Univ, Dept Mol Biol & Biotechnol, Napaam, Assam, India
[4] Reg Inst Med Sci, Dept Paediat, Imphal, Manipur, India
[5] Univ Malaysia Kelantan, Fac Agrobased Ind, Dept Agr Sci, Kelantan, Malaysia
[6] King Faisal Univ, Coll Appl Med Sci, Dept Nursing, Al Hasa, Saudi Arabia
[7] Tezpur Univ, Dept Comp Sci & Engn, Napaam, Assam, India
来源
PEERJ | 2025年 / 13卷
关键词
Rotavirus; Gastroenteritis; Machine learning; Disease diagnosis; Supervised learning; Child health; ARTIFICIAL-INTELLIGENCE; DIARRHEA; SURVEILLANCE; IDENTIFICATION; INFECTION; DISEASES; BURDEN; IMPACT;
D O I
10.7717/peerj.19025
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background. Rotavirus is the leading cause of severe dehydrating diarrhea in children under 5 years worldwide. Timely diagnosis is critical, but access to confirmatory testing is limited in hospital settings. Machine learning (ML) models have shown promising potential in supporting symptom-based diagnosis of several diseases in resource-limited settings. Objectives. This study aims to develop a machine-learning predictive model integrated with multiple sources of clinical parameters specific to rotavirus infection without relying on laboratory tests. Methods A clinical dataset of 509 children was collected in collaboration with the Regional Institute of Medical Sciences, Imphal, India. The clinical symptoms included diarrhea and its duration, number of stool episodes per day, fever, vomiting and its duration, number of vomiting episodes per day, temperature and dehydration. Correlation analysis is performed to check the feature-feature and feature-outcome collinearity. Feature selection using ANOVA F test is carried out to find the feature importance values and finally obtain the reduced feature subset. Seven supervised learning models were tested and compared viz., support vector machine (SVM), K-nearest neighbor (KNN), naive Bayes (NB), logistic regression (Log_R) , random forest (RF), decision tree (DT), and XGBoost (XGB). A comparison of the performances of the seven models using the classification results obtained. The performance of the models was evaluated based on accuracy, precision, recall, specificity, F1 score, macro F1, F2, and receiver operator characteristic curve. Results. The seven ML models were exhaustively experimented on our dataset and compared based on eight evaluation scores which are accuracy, precision, recall, specificity, F1 score, F2 score, macro F1 score, and AUC values computed. We observed that when the seven ML models were applied, RF performed the best with an accuracy of 81.4%, F1 score of 86.9%, macro F1-score of 77.3%, F2 score of 86.5% and area under the curve (AUC) of 89%. Conclusions. The machine learning models can contribute to predicting symptom-based diagnosis of rotavirus-associated acute gastroenteritis in children, especially in resource-limited settings. Further validation of the models using a large dataset is needed for predicting pediatric diarrheic populations with optimum sensitivity and specificity. /span>
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Using machine learning to identify risk factors for pancreatic cancer: a retrospective cohort study of real-world data
    Su, Na
    Tang, Rui
    Zhang, Yice
    Ni, Jiaqi
    Huang, Yimei
    Liu, Chunqi
    Xiao, Yuzhou
    Zhu, Baoting
    Zhao, Yinglan
    FRONTIERS IN PHARMACOLOGY, 2024, 15
  • [32] Using machine learning to identify air pollution exposure profiles associated with early cognitive skills among US children
    Stingone, Jeanette A.
    Pandey, Om P.
    Claudio, Luz
    Pandey, Gaurav
    ENVIRONMENTAL POLLUTION, 2017, 230 : 730 - 740
  • [33] Construction of a predictive model for bone metastasis from first primary lung adenocarcinoma within 3 cm based on machine learning algorithm: a retrospective study
    Zhang, Yu
    Xiao, Lixia
    Lyu, Lan
    Zhang, Liwei
    PEERJ, 2024, 12
  • [34] Machine learning model identifies aggressive acute pancreatitis within 48 h of admission: a large retrospective study
    Yuan, Lei
    Ji, Mengyao
    Wang, Shuo
    Wen, Xinyu
    Huang, Pingxiao
    Shen, Lei
    Xu, Jun
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [35] Predictive model for acute respiratory distress syndrome events in ICU patients in China using machine learning algorithms: a secondary analysis of a cohort study
    Ding, Xian-Fei
    Li, Jin-Bo
    Liang, Huo-Yan
    Wang, Zong-Yu
    Jiao, Ting-Ting
    Liu, Zhuang
    Yi, Liang
    Bian, Wei-Shuai
    Wand, Shu-Peng
    Zhu, Xi
    Sun, Tong-Wen
    JOURNAL OF TRANSLATIONAL MEDICINE, 2019, 17 (01)
  • [36] An Explainable Machine Learning Model to Predict Acute Kidney Injury After Cardiac Surgery: A Retrospective Cohort Study
    Gao, Yuchen
    Wang, Chunrong
    Dong, Wenhao
    Li, Bianfang
    Wang, Jianhui
    Li, Jun
    Tian, Yu
    Liu, Jia
    Wang, Yuefu
    CLINICAL EPIDEMIOLOGY, 2023, 15 : 1145 - 1157
  • [37] Machine learning model-based prediction of postpancreatectomy acute pancreatitis following pancreaticoduodenectomy: A retrospective cohort study
    Ma, Ji-Ming
    Wang, Peng-Fei
    Yang, Liu-Qing
    Wang, Jun-Kai
    Song, Jian-Ping
    Li, Yu-Mei
    Wen, Yan
    Tang, Bing-Jun
    Wang, Xue-Dong
    WORLD JOURNAL OF GASTROENTEROLOGY, 2025, 31 (08)
  • [38] Using Machine Learning Models to Identify Factors Associated With 30-Day Readmissions After Posterior Cervical Fusions: A Longitudinal Cohort Study
    Gonzalez-Suarez, Aneysis D.
    Rezaii, Paymon G.
    Herrick, Daniel
    Tigchelaar, Seth Stravers
    Ratliff, John K.
    Rusu, Mirabela
    Scheinker, David
    Jeon, Ikchan
    Desai, Atman M.
    NEUROSPINE, 2024, 21 (02) : 620 - 632
  • [39] Prediction of post-operative acute pancreatitis in children with pancreaticobiliary maljunction using machine learning model
    Cai, Tian-na
    Huang, Shun-gen
    Yang, Yang
    Mao, Hui-min
    Guo, Wan-liang
    PEDIATRIC SURGERY INTERNATIONAL, 2023, 39 (01)
  • [40] Prediction of post-operative acute pancreatitis in children with pancreaticobiliary maljunction using machine learning model
    Tian-na Cai
    Shun-gen Huang
    Yang Yang
    Hui-min Mao
    Wan-liang Guo
    Pediatric Surgery International, 39