Machine learning based efficient prediction of positive cases of waterborne diseases

被引:8
作者
Hussain, Mushtaq [1 ]
Cifci, Mehmet Akif [2 ,3 ]
Sehar, Tayyaba [1 ]
Nabi, Said [1 ]
Cheikhrouhou, Omar [4 ]
Maqsood, Hasaan [5 ]
Ibrahim, Muhammad [6 ]
Mohammad, Fida [5 ]
机构
[1] Virtual Univ Pakistan, Dept Comp Sci & Informat Technol, Lahore, Pakistan
[2] Bandirma Onyedi Eylul Univ, Dept Comp Engn, Balikesir, Turkiye
[3] Klaipeda State Univ Appl Sci, Informat, LT-91274 Klaipeda, Lithuania
[4] Univ Sfax, Natl Sch Engineers Sfax, CES Lab, Sfax 3038, Tunisia
[5] Univ Haripur, Dept Informat Technol, Haripur, Pakistan
[6] Jeju Natl Univ, Dept Comp Engn, Jeju Si, South Korea
关键词
Machine learning; Patient information; Malaria; Typhoid; Waterborne disease; DRINKING-WATER;
D O I
10.1186/s12911-022-02092-1
中图分类号
R-058 [];
学科分类号
摘要
BackgroundWater quality has been compromised and endangered by different contaminants due to Pakistan's rapid population development, which has resulted in a dramatic rise in waterborne infections and afflicted many regions of Pakistan. Because of this, modeling and predicting waterborne diseases has become a hot topic for researchers and is very important for controlling waterborne disease pollution.MethodsIn our study, first, we collected typhoid and malaria patient data for the years 2017-2020 from Ayub Medical Hospital. The collected data set has seven important input features. In the current study, different ML models were first trained and tested on the current study dataset using the tenfold cross-validation method. Second, we investigated the importance of input features in waterborne disease-positive case detection. The experiment results showed that Random Forest correctly predicted malaria-positive cases 60% of the time and typhoid-positive cases 77% of the time, which is better than other machine-learning models. In this research, we have also investigated the input features that are more important in the prediction and will help analyze positive cases of waterborne disease. The random forest feature selection technique has been used, and experimental results have shown that age, history, and test results play an important role in predicting waterborne disease-positive cases. In the end, we concluded that this interesting study could help health departments in different areas reduce the number of people who get sick from the water.
引用
收藏
页数:16
相关论文
共 40 条
[1]  
Ahmed M., 2020, Int. J. Adv. Sci. Technol., V29, P275
[2]   Efficient Water Quality Prediction Using Supervised Machine Learning [J].
Ahmed, Umair ;
Mumtaz, Rafia ;
Anwar, Hirra ;
Shah, Asad A. ;
Irfan, Rabia ;
Garcia-Nieto, Jose .
WATER, 2019, 11 (11)
[3]  
Archana A, 2015, EVALUATING MICROBIAL
[4]  
Ayub medical college Abbottabad, 2021, PAK AYUB TEACH HOSP
[5]   Prediction Is a Balancing Act: Importance of Sampling Methods to Balance Sensitivity and Specificity o Predictive Models Based on Imbalanced Chemical Data Sets [J].
Banerjee, Priyanka ;
Dehnbostel, Frederic O. ;
Preissner, Robert .
FRONTIERS IN CHEMISTRY, 2018, 6
[6]   Predicting risk of early discontinuation of exclusive breastfeeding at a Brazilian referral hospital for high-risk neonates and infants: a decision-tree analysis [J].
Bernardes Silva, Maira Domingues ;
Carvalhaes de Oliveira, Raquel de Vasconcellos ;
Barroso Alves, Davi Da Silveira ;
Prates Melo, Enirtes Caetano .
INTERNATIONAL BREASTFEEDING JOURNAL, 2021, 16 (01)
[7]   Cholera Risk: A Machine Learning Approach Applied to Essential Climate Variables [J].
Campbell, Amy Marie ;
Racault, Marie-Fanny ;
Goult, Stephen ;
Laurenson, Angus .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (24) :1-24
[8]   Integrating field observations and process-based modeling to predict watershed water quality under environmental perturbations [J].
Chen, Xingyuan ;
Lee, Raymond Mark ;
Dwivedi, Dipankar ;
Son, Kyongho ;
Fang, Yilin ;
Zhang, Xuesong ;
Graham, Emily ;
Stegen, James ;
Fisher, Joshua B. ;
Moulton, David ;
Scheibe, Timothy D. .
JOURNAL OF HYDROLOGY, 2021, 602
[9]   A survey of machine learning methods applied to anomaly detection on drinking-water quality data [J].
Dogo, Eustace M. ;
Nwulu, Nnamdi, I ;
Twala, Bhekisipho ;
Aigbavboa, Clinton .
URBAN WATER JOURNAL, 2019, 16 (03) :235-248
[10]  
Dubey Harshit, 2013, Advances in Knowledge Discovery and Data Mining. 17th Pacific-Asia Conference (PAKDD 2013). Proceedings, P305, DOI 10.1007/978-3-642-37456-2_26