Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引:6
|
作者
Ong, Song Quan [1 ]
Isawasan, Pradeep [2 ]
Ngesom, Ahmad Mohiddin Mohd [3 ]
Shahar, Hanipah [4 ]
Lasim, As'malia Md [5 ]
Nair, Gomesh [6 ]
机构
[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia
[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia
[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia
[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia
[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia
关键词
D O I
10.1038/s41598-023-46342-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Comparing the performance of 10 machine learning models in predicting Chlorophyll a in western Lake Erie
    Song, Yang
    Shen, Chunqi
    Hong, Yi
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2025, 380
  • [32] Predicting the Shape of Corneas from Clinical Data with Machine Learning Models
    Bouazizi, Hala
    Brunette, Isabelle
    Meunier, Jean
    IRBM, 2024, 45 (05)
  • [33] Developing and comparing machine learning approaches for predicting insurance penetration rates based on each country
    Ghorashi, Seyed Farshid
    Bahri, Maziyar
    Goodarzi, Atousa
    LETTERS IN SPATIAL AND RESOURCE SCIENCES, 2024, 17 (01)
  • [34] Proposing Machine Learning Models Suitable for Predicting Open Data Utilization
    Jeong, Junyoung
    Cho, Keuntae
    SUSTAINABILITY, 2024, 16 (14)
  • [35] Machine learning models for predicting hepatic steatosis based on in vivo data
    Zdrazil, Barbara
    Jain, Sankalp
    Klinting, Signe
    Escher, Sylvia
    Ecker, Gerhard
    Norinder, Ulf
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2018, 256
  • [36] Comparing Different Resampling Methods in Predicting Students Performance Using Machine Learning Techniques
    Ghorbani, Ramin
    Ghousi, Rouzbeh
    IEEE ACCESS, 2020, 8 : 67899 - 67911
  • [37] Analysis of artificial intelligence models for predicting vicat temperature of various compounds based on biohdpe comparing different machine learning techniques
    Llorca-Alcon, Manuel
    Garcia-Sanoguera, David
    Linares-Pellicer, Jordi
    Molina-Pico, Antonio
    DYNA, 2025, 100 (01): : 90 - 96
  • [38] Coupling meteorological stations data and satellite data for prediction of global solar radiation with machine learning models
    Zhao, Shuting
    Wu, Lifeng
    Xiang, Youzhen
    Dong, Jianhua
    Li, Zhen
    Liu, Xiaoqiang
    Tang, Zijun
    Wang, Han
    Wang, Xin
    An, Jiaqi
    Zhang, Fucang
    Li, Zhijun
    RENEWABLE ENERGY, 2022, 198 : 1049 - 1064
  • [39] Comparing two machine learning approaches in predicting lupus hospitalization using longitudinal data
    Zhao, Yijun
    Smith, Dylan
    Jorge, April
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [40] Comparing two machine learning approaches in predicting lupus hospitalization using longitudinal data
    Yijun Zhao
    Dylan Smith
    April Jorge
    Scientific Reports, 12