Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引:6
|
作者
Ong, Song Quan [1 ]
Isawasan, Pradeep [2 ]
Ngesom, Ahmad Mohiddin Mohd [3 ]
Shahar, Hanipah [4 ]
Lasim, As'malia Md [5 ]
Nair, Gomesh [6 ]
机构
[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia
[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia
[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia
[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia
[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia
关键词
D O I
10.1038/s41598-023-46342-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Comparative analysis of advanced deep learning models for predicting evapotranspiration based on meteorological data in bangladesh
    Paul, Sourov
    Farzana, Syeda Zehan
    Das, Saikat
    Das, Pobithra
    Kashem, Abul
    Environmental Science and Pollution Research, 2024, 31 (50) : 60041 - 60064
  • [42] Evaluation of different machine learning models for predicting and mapping the susceptibility of gully erosion
    Rahmati, Omid
    Tahmasebipour, Nasser
    Haghizadeh, Ali
    Pourghasemi, Hamid Reza
    Feizizadeh, Bakhtiar
    GEOMORPHOLOGY, 2017, 298 : 118 - 137
  • [43] Predicting congenital syphilis cases: A performance evaluation of different machine learning models
    Teixeira, Igor Vitor
    Leite, Morgana Thalita da Silva
    Melo, Flavio Leandro de Morais
    Rocha, Elisson da Silva
    Sadok, Sara
    Carrarine, Ana Sofia Pessoa da Costa
    Santana, Marilia
    Rodrigues, Cristina Pinheiro
    Oliveira, Ana Maria de Lima
    Gadelha, Keduly Vieira
    de Morais, Cleber Matos
    Kelner, Judith
    Endo, Patricia Takako
    PLOS ONE, 2023, 18 (06):
  • [44] Machine learning models for predicting rock fracture toughness at different temperature conditions
    Hu, Xunjian
    Liao, Dong
    Ma, Dongdong
    Xie, Senlin
    Xie, Ni
    Hu, Haibo
    Gong, Xiaonan
    CASE STUDIES IN CONSTRUCTION MATERIALS, 2023, 19
  • [45] Estimation of Reference Crop Evapotranspiration with Three Different Machine Learning Models and Limited Meteorological Variables
    Yong, Stephen Luo Sheng
    Ng, Jing Lin
    Huang, Yuk Feng
    Ang, Chun Kit
    AGRONOMY-BASEL, 2023, 13 (04):
  • [46] Comparing SMOTE Family Techniques in Predicting Insurance Premium Defaulting using Machine Learning Models
    Kotb, Mohamed Hanafy
    Ming, Ruixing
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (09) : 621 - 629
  • [47] Comparing Traditional Regression and Machine Learning Models in Predicting Acute Respiratory Distress Syndrome Mortality
    Fernandez, Marcos Valiente
    de Aledo, Amanda Lesmes Gonzalez
    Badia, Isaias Martin
    Moya, Francisco de Paula Delgado
    CRITICAL CARE MEDICINE, 2024, 52 (02) : e105 - e106
  • [48] Machine learning model for predicting out-of-hospital cardiac arrests using meteorological and chronological data
    Nakashima, Takahiro
    Ogata, Soshiro
    Noguchi, Teruo
    Tahara, Yoshio
    Onozuka, Daisuke
    Kato, Satoshi
    Yamagata, Yoshiki
    Kojima, Sunao
    Iwami, Taku
    Sakamoto, Tetsuya
    Nagao, Ken
    Nonogi, Hiroshi
    Yasuda, Satoshi
    Iihara, Koji
    Neumar, Robert
    Nishimura, Kunihiro
    HEART, 2021, 107 (13) : 1084 - 1091
  • [49] The development of machine learning models for predicting the obesity risk from MARKAGE data
    Valeanu, A.
    Gradinaru, D.
    FEBS OPEN BIO, 2024, 14 : 342 - 342
  • [50] A SYSTEMATIC REVIEW OF MACHINE LEARNING MODELS FOR PREDICTING OUTCOMES OF STROKE WITH STRUCTURED DATA
    Wang, W.
    Kiik, M.
    Peek, N.
    Curcin, V.
    Marshall, I.
    Rudd, A.
    Wang, Y.
    Douiri, A.
    Wolfe, C.
    Bray, B.
    INTERNATIONAL JOURNAL OF STROKE, 2020, 15 (1_SUPPL) : 641 - 641