Predicting dengue transmission rates by comparing different machine learning models with vector indices and meteorological data

被引:6
|
作者
Ong, Song Quan [1 ]
Isawasan, Pradeep [2 ]
Ngesom, Ahmad Mohiddin Mohd [3 ]
Shahar, Hanipah [4 ]
Lasim, As'malia Md [5 ]
Nair, Gomesh [6 ]
机构
[1] Univ Malaysia Sabah, Inst Trop Biol & Conservat, Entomol Lab, Jalan UMS, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Teknol MARA, Fac Comp & Math Sci, Perak Branch, Tapah Campus, Tapah 35400, Malaysia
[3] Minist Hlth, Inst Publ Hlth, Natl Inst Hlth, Ctr Communicable Dis Res, Shah Alam, Malaysia
[4] Fed Terr Kuala Lumpur & Putrajaya Hlth Dept, Entomol & Pest Unit, Jalan Cenderasari, Kuala Lumpur 50590, Malaysia
[5] Natl Hlth Inst, Inst Med Res, Herbal Med Res Ctr, Phytochem Unit, Setia Alam, Malaysia
[6] Univ Sains Malaysia, Sch Elect & Elect Engn, PeraiPenang, Malaysia
关键词
D O I
10.1038/s41598-023-46342-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Machine learning algorithms (ML) are receiving a lot of attention in the development of predictive models for monitoring dengue transmission rates. Previous work has focused only on specific weather variables and algorithms, and there is still a need for a model that uses more variables and algorithms that have higher performance. In this study, we use vector indices and meteorological data as predictors to develop the ML models. We trained and validated seven ML algorithms, including an ensemble ML method, and compared their performance using the receiver operating characteristic (ROC) with the area under the curve (AUC), accuracy and F1 score. Our results show that an ensemble ML such as XG Boost, AdaBoost and Random Forest perform better than the logistics regression, Naive Bayens, decision tree, and support vector machine (SVM), with XGBoost having the highest AUC, accuracy and F1 score. Analysis of the importance of the variables showed that the container index was the least important. By removing this variable, the ML models improved their performance by at least 6% in AUC and F1 score. Our result provides a framework for future studies on the use of predictive models in the development of an early warning system.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] The effect of soil physical properties on predicting shear strength parameters based on comparing ensemble learning, deep learning, and support vector machine models
    Nguyen, Ba - Quang - Vinh
    Kim, Yun - Tae
    GEOMECHANICS AND ENGINEERING, 2024, 39 (02) : 241 - 256
  • [22] Comparing different machine learning techniques for predicting COVID-19 severity
    Xiong, Yibai
    Ma, Yan
    Ruan, Lianguo
    Li, Dan
    Lu, Cheng
    Huang, Luqi
    INFECTIOUS DISEASES OF POVERTY, 2022, 11 (01)
  • [23] Comparing different machine learning techniques for predicting COVID-19 severity
    Yibai Xiong
    Yan Ma
    Lianguo Ruan
    Dan Li
    Cheng Lu
    Luqi Huang
    Infectious Diseases of Poverty, 11
  • [24] Predicting the infecting dengue serotype from antibody titre data using machine learning
    Daniels, Bethan Cracknell
    Buddhari, Darunee
    Hunsawong, Taweewun
    Iamsirithaworn, Sopon
    Farmer, Aaron R.
    Cummings, Derek A. T.
    Anderson, Kathryn B.
    Dorigatti, Ilaria
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (12)
  • [25] Exploring the efficacy of machine learning models for predicting soil radon exhalation rates
    Khaled F. Al-Shboul
    Ghassan Almasabha
    Ali Shehadeh
    Odey Alshboul
    Stochastic Environmental Research and Risk Assessment, 2023, 37 : 4307 - 4321
  • [26] Exploring the efficacy of machine learning models for predicting soil radon exhalation rates
    Al-Shboul, Khaled F. F.
    Almasabha, Ghassan
    Shehadeh, Ali
    Alshboul, Odey
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2023, 37 (11) : 4307 - 4321
  • [27] Predicting Employability of Candidates: Comparative Study of Different Machine Learning Models
    Hitharth, K. B. Sai
    Dhanya, N. M.
    PROCEEDINGS OF EMERGING TRENDS AND TECHNOLOGIES ON INTELLIGENT SYSTEMS (ETTIS 2021), 2022, 1371 : 179 - 190
  • [28] Learning Rates of Support Vector Machine Classifiers with Data Dependent Hypothesis Spaces
    Sheng, Bao-Huai
    Ye, Pei-Xin
    JOURNAL OF COMPUTERS, 2012, 7 (01) : 252 - 257
  • [29] Comparing the Performance of 17 Machine Learning Models in Predicting Human Population Growth of Countries
    Otoom, Mohammad Mahmood
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (01): : 220 - 225
  • [30] Comparing Machine Learning Algorithms And Regression Models For Predicting Functional Outcome In The Stratis Registry
    Jumaa, Mouhammad A.
    Zoghi, Zeinab
    Zaidi, Syed F.
    Mueller-Kronast, Nils
    Zaidat, Osama
    Castonguay, Alicia C.
    STROKE, 2022, 53