Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引：6

作者：

Ong, Song Quan ^{[1
]}

Isawasan, Pradeep ^{[2
]}

Ngesom, Ahmad Mohiddin Mohd ^{[3
]}

Shahar, Hanipah ^{[4
]}

Lasim, As'malia Md ^{[5
]}

Nair, Gomesh ^{[6
]}

机构：

[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia

[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia

[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia

[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia

[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia

[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia

来源：

SCIENTIFIC REPORTS | 2023年 / 13卷 / 01期

关键词：

D O I：

10.1038/s41598-023-46342-2

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.

引用

页数：11

共 50 条

[31] Comparing the performance of 10 machine learning models in predicting Chlorophyll a in western Lake Erie
Song, Yang
Shen, Chunqi
Hong, Yi
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 380
[32] Predicting the Shape of Corneas from Clinical Data with Machine Learning Models
Bouazizi, Hala
Brunette, Isabelle
Meunier, Jean
IRBM, 2024, 45 (05)
[33] Developing and comparing machine learning approaches for predicting insurance penetration rates based on each country
Ghorashi, Seyed Farshid
Bahri, Maziyar
Goodarzi, Atousa
LETTERS IN SPATIAL AND RESOURCE SCIENCES, 2024, 17 (01)
[34] Proposing Machine Learning Models Suitable for Predicting Open Data Utilization
Jeong, Junyoung
Cho, Keuntae
SUSTAINABILITY, 2024, 16 (14)
[35] Machine learning models for predicting hepatic steatosis based on in vivo data
Zdrazil, Barbara
Jain, Sankalp
Klinting, Signe
Escher, Sylvia
Ecker, Gerhard
Norinder, Ulf
ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
[36] Comparing Different Resampling Methods in Predicting Students Performance Using Machine Learning Techniques
Ghorbani, Ramin
Ghousi, Rouzbeh
IEEE ACCESS, 2020, 8 : 67899 - 67911
[37] Analysis of artificial intelligence models for predicting vicat temperature of various compounds based on biohdpe comparing different machine learning techniques
Llorca-Alcon, Manuel
Garcia-Sanoguera, David
Linares-Pellicer, Jordi
Molina-Pico, Antonio
DYNA, 2025, 100 (01): : 90 - 96
[38] Coupling meteorological stations data and satellite data for prediction of global solar radiation with machine learning models
Zhao, Shuting
Wu, Lifeng
Xiang, Youzhen
Dong, Jianhua
Li, Zhen
Liu, Xiaoqiang
Tang, Zijun
Wang, Han
Wang, Xin
An, Jiaqi
Zhang, Fucang
Li, Zhijun
RENEWABLE ENERGY, 2022, 198 : 1049 - 1064
[39] Comparing two machine learning approaches in predicting lupus hospitalization using longitudinal data
Zhao, Yijun
Smith, Dylan
Jorge, April
SCIENTIFIC REPORTS, 2022, 12 (01)
[40] Comparing two machine learning approaches in predicting lupus hospitalization using longitudinal data
Yijun Zhao
Dylan Smith
April Jorge
Scientific Reports, 12

← 1 2 3 4 5 →