Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引：6

作者：

Ong, Song Quan ^{[1
]}

Isawasan, Pradeep ^{[2
]}

Ngesom, Ahmad Mohiddin Mohd ^{[3
]}

Shahar, Hanipah ^{[4
]}

Lasim, As'malia Md ^{[5
]}

Nair, Gomesh ^{[6
]}

机构：

[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia

[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia

[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia

[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia

[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia

[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

关键词：

D O I：

10.1038/s41598-023-46342-2

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.

引用

页数：11

共 50 条

[21] The effect of soil physical properties on predicting shear strength parameters based on comparing ensemble learning, deep learning, and support vector machine models
Nguyen, Ba - Quang - Vinh
Kim, Yun - Tae
GEOMECHANICS AND ENGINEERING, 2024, 39 (02) : 241 - 256
[22] Comparing different machine learning techniques for predicting COVID-19 severity
Xiong, Yibai
Ma, Yan
Ruan, Lianguo
Li, Dan
Lu, Cheng
Huang, Luqi
INFECTIOUS DISEASES OF POVERTY, 2022, 11 (01)
[23] Comparing different machine learning techniques for predicting COVID-19 severity
Yibai Xiong
Yan Ma
Lianguo Ruan
Dan Li
Cheng Lu
Luqi Huang
Infectious Diseases of Poverty, 11
[24] Predicting the infecting dengue serotype from antibody titre data using machine learning
Daniels, Bethan Cracknell
Buddhari, Darunee
Hunsawong, Taweewun
Iamsirithaworn, Sopon
Farmer, Aaron R.
Cummings, Derek A. T.
Anderson, Kathryn B.
Dorigatti, Ilaria
PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (12)
[25] Exploring the efficacy of machine learning models for predicting soil radon exhalation rates
Khaled F. Al-Shboul
Ghassan Almasabha
Ali Shehadeh
Odey Alshboul
Stochastic Environmental Research and Risk Assessment, 2023, 37 : 4307 - 4321
[26] Exploring the efficacy of machine learning models for predicting soil radon exhalation rates
Al-Shboul, Khaled F. F.
Almasabha, Ghassan
Shehadeh, Ali
Alshboul, Odey
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2023, 37 (11) : 4307 - 4321
[27] Predicting Employability of Candidates: Comparative Study of Different Machine Learning Models
Hitharth, K. B. Sai
Dhanya, N. M.
PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 179 - 190
[28] Learning Rates of Support Vector Machine Classifiers with Data Dependent Hypothesis Spaces
Sheng, Bao-Huai
Ye, Pei-Xin
JOURNAL OF COMPUTERS, 2012, 7 (01) : 252 - 257
[29] Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries
Otoom, Mohammad Mahmood
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (01): : 220 - 225
[30] Comparing Machine Learning Algorithms And Regression Models For Predicting Functional Outcome In The Stratis Registry
Jumaa, Mouhammad A.
Zoghi, Zeinab
Zaidi, Syed F.
Mueller-Kronast, Nils
Zaidat, Osama
Castonguay, Alicia C.
STROKE, 2022, 53

← 1 2 3 4 5 →