Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引:6
|
作者
Ong, Song Quan [1 ]
Isawasan, Pradeep [2 ]
Ngesom, Ahmad Mohiddin Mohd [3 ]
Shahar, Hanipah [4 ]
Lasim, As'malia Md [5 ]
Nair, Gomesh [6 ]
机构
[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia
[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia
[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia
[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia
[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia
关键词
D O I
10.1038/s41598-023-46342-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data
    Song Quan Ong
    Pradeep Isawasan
    Ahmad Mohiddin Mohd Ngesom
    Hanipah Shahar
    As’malia Md Lasim
    Gomesh Nair
    Scientific Reports, 13
  • [2] Comparing vector-host and SIR models for dengue transmission
    Pandey, Abhishek
    Mubayi, Anuj
    Medlock, Jan
    MATHEMATICAL BIOSCIENCES, 2013, 246 (02) : 252 - 259
  • [3] A Comparative Study of Machine Learning Models for Predicting Meteorological Data in Agricultural Applications
    Suljug, Jelena
    Spisic, Josip
    Grgic, Kresimir
    Zagar, Drago
    ELECTRONICS, 2024, 13 (16)
  • [4] Predicting Dengue Fever Transmission Using Machine Learning Methods
    Siddiq, A.
    Shukla, N.
    Pradhan, B.
    2021 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT (IEEE IEEM21), 2021, : 21 - 26
  • [5] PREDICTION OF TRANSMISSION RATES OF DENGUE IN NATIONAL CAPITAL TERRITORY DELHI USING MACHINE LEARNING MODELS
    Sharma, Vipasha
    Ghosh, Sanjay Kumar
    Khare, Siddhartha
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1117 - 1122
  • [6] Machine learning models for predicting the performance of solar-geothermal desalination in different meteorological conditions
    Farahani, Somayeh Davoodabadi
    Farahani, Amir Davoodabadi
    AIN SHAMS ENGINEERING JOURNAL, 2024, 15 (03)
  • [7] Estimation of soil temperature from meteorological data using different machine learning models
    Feng, Yu
    Cui, Ningbo
    Hao, Weiping
    Gao, Lili
    Gong, Daozhi
    GEODERMA, 2019, 338 : 67 - 77
  • [8] Predicting Runway Configurations and Arrival and Departure Rates at Airports: Comparing the Accuracy of Multiple Machine Learning Models
    Raju, Ramakrishna
    Mital, Rohit
    Wilson, Bruce
    Shetty, Kamala
    Albert, Michael
    2021 IEEE/AIAA 40TH DIGITAL AVIONICS SYSTEMS CONFERENCE (DASC), 2021,
  • [9] Support Vector Machine Outperforms Other Machine Learning Models in Early Diagnosis of Dengue Using Routine Clinical Data
    Qaiser, Ariba
    Manzoor, Sobia
    Hashmi, Asraf Hussain
    Javed, Hasnain
    Zafar, Anam
    Ashraf, Javed
    ADVANCES IN VIROLOGY, 2024, 2024
  • [10] Predicting leishmaniasis outbreaks in Brazil using machine learning models based on disease surveillance and meteorological data
    Donizette, Andre Cintas
    Rocco, Cleber Damiao
    de Queiroz, Thiago Alves
    OPERATIONS RESEARCH FOR HEALTH CARE, 2025, 44