Analysis and prediction of hand, foot and mouth disease incidence in China using Random Forest and XGBoost

被引:23
作者
Meng, Delin [1 ]
Xu, Jun [2 ]
Zhao, Jijun [1 ]
机构
[1] Qingdao Univ, Complex Sci Inst, Qingdao, Shandong, Peoples R China
[2] Chinese Acad Sci, Inst Geog Sci & Nat Resources Res, State Key Lab Resources & Environm Informat Syst, Beijing, Peoples R China
来源
PLOS ONE | 2021年 / 16卷 / 12期
关键词
DYNAMICS; TRANSMISSION; INFECTIONS; CITY;
D O I
10.1371/journal.pone.0261629
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Hand, foot and mouth disease (HFMD) is an increasingly serious public health problem, and it has caused an outbreak in China every year since 2008. Predicting the incidence of HFMD and analyzing its influential factors are of great significance to its prevention. Now, machine learning has shown advantages in infectious disease models, but there are few studies on HFMD incidence based on machine learning that cover all the provinces in mainland China. In this study, we proposed two different machine learning algorithms, Random Forest and eXtreme Gradient Boosting (XGBoost), to perform our analysis and prediction. We first used Random Forest to examine the association between HFMD incidence and potential influential factors for 31 provinces in mainland China. Next, we established Random Forest and XGBoost prediction models using meteorological and social factors as the predictors. Finally, we applied our prediction models in four different regions of mainland China and evaluated the performance of them. Our results show that: 1) Meteorological factors and social factors jointly affect the incidence of HFMD in mainland China. Average temperature and population density are the two most significant influential factors; 2) Population flux has different delayed effect in affecting HFMD incidence in different regions. From a national perspective, the model using population flux data delayed for one month has better prediction performance; 3) The prediction capability of XGBoost model was better than that of Random Forest model from the overall perspective. XGBoost model is more suitable for predicting the incidence of HFMD in mainland China.
引用
收藏
页数:16
相关论文
共 42 条
  • [1] Integration of multi-omics data for prediction of phenotypic traits using random forest
    Acharjee, Animesh
    Kloosterman, Bjorn
    Visser, Richard G. F.
    Maliepaard, Chris
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [2] Comparison of ARIMA model and XGBoost model for prediction of human brucellosis in mainland China: a time-series study
    Alim, Mirxat
    Ye, Guo-Hua
    Guan, Peng
    Huang, De-Sheng
    Zhou, Bao-Sen
    Wu, Wei
    [J]. BMJ OPEN, 2020, 10 (12):
  • [3] A Multi-Stage Machine Learning Approach to Predict Dengue Incidence: A Case Study in Mexico
    Appice, Annalisa
    Gel, Yulia R.
    Iliev, Iliyan
    Lyubchich, Vyacheslav
    Malerba, Donato
    [J]. IEEE ACCESS, 2020, 8 : 52713 - 52725
  • [4] Using an autologistic regression model to identify spatial risk factors and spatial risk patterns of hand, foot and mouth disease (HFMD) in Mainland China
    Bo, Yan-Chen
    Song, Chao
    Wang, Jin-Feng
    Li, Xiao-Wen
    [J]. BMC PUBLIC HEALTH, 2014, 14
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Machine learning methods reveal the temporal pattern of dengue incidence using meteorological factors in metropolitan Manila, Philippines
    Carvajal, Thaddeus M.
    Viacrusis, Katherine M.
    Hernandez, Lara Fides T.
    Ho, Howell T.
    Amalin, Divina M.
    Watanabe, Kozo
    [J]. BMC INFECTIOUS DISEASES, 2018, 18
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Predicting the hand, foot, and mouth disease incidence using search engine query data and climate variables: an ecological study in Guangdong, China
    Du, Zhicheng
    Xu, Lin
    Zhang, Wangjian
    Zhang, Dingmei
    Yu, Shicheng
    Hao, Yuantao
    [J]. BMJ OPEN, 2017, 7 (10):
  • [9] Duan CX, 2019, EPIDEMIOL INFECT, V147, DOI [10.1017/S0950268818003035, 10.1017/s0950268818003035]
  • [10] Time Series Analysis of Hand-Foot-Mouth Disease Hospitalization in Zhengzhou: Establishment of Forecasting Models Using Climate Variables as Predictors
    Feng, Huifen
    Duan, Guangcai
    Zhang, Rongguang
    Zhang, Weidong
    [J]. PLOS ONE, 2014, 9 (01):