Comparison of ARIMA and Random Forest time series models for prediction of avian influenza H5N1 outbreaks

被引:223
作者
Kane, Michael J. [1 ,2 ]
Price, Natalie [3 ]
Scotch, Matthew [4 ,5 ]
Rabinowitz, Peter [6 ]
机构
[1] Yale Univ, Yale Ctr Analyt Sci, New Haven, CT 06520 USA
[2] Yale Univ, Dept Biostat, New Haven, CT USA
[3] Harvard Univ, Sch Publ Hlth, Cambridge, MA 02138 USA
[4] Arizona State Univ, Coll Hlth Solut, Dept Biomed Inform, Tempe, AZ USA
[5] Arizona State Univ, Ctr Environm Secur, Biodesign Inst & Secur & Def Syst Initiat, Tempe, AZ USA
[6] Univ Washington, Environm & Occupat Hlth Sci, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
MORTALITY; POLICY;
D O I
10.1186/1471-2105-15-276
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Time series models can play an important role in disease prediction. Incidence data can be used to predict the future occurrence of disease events. Developments in modeling approaches provide an opportunity to compare different time series models for predictive power. Results: We applied ARIMA and Random Forest time series models to incidence data of outbreaks of highly pathogenic avian influenza (H5N1) in Egypt, available through the online EMPRES-I system. We found that the Random Forest model outperformed the ARIMA model in predictive ability. Furthermore, we found that the Random Forest model is effective for predicting outbreaks of H5N1 in Egypt. Conclusions: Random Forest time series modeling provides enhanced predictive ability over existing time series models for the prediction of infectious disease outbreaks. This result, along with those showing the concordance between bird and human outbreaks (Rabinowitz et al. 2012), provides a new approach to predicting these dangerous outbreaks in bird populations based on existing, freely available data. Our analysis uncovers the time-series structure of outbreak severity for highly pathogenic avain influenza (H5N1) in Egypt.
引用
收藏
页数:9
相关论文
共 21 条
[1]   GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215
[4]   Modeling seasonal leptospirosis transmission and its association with rainfall and temperature in Thailand using time-series and ARIMAX analyses [J].
Chadsuthi, Sudarat ;
Modchang, Charin ;
Lenbury, Yongwimon ;
Iamsirithaworn, Sopon ;
Triampo, Wannapong .
ASIAN PACIFIC JOURNAL OF TROPICAL MEDICINE, 2012, 5 (07) :539-546
[5]   Monitoring mortality as an indicator of influenza in Catalonia, Spain [J].
Dominguez, A ;
Munoz, P ;
Martinez, A ;
Orcau, A .
JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 1996, 50 (03) :293-298
[6]  
Food and Agriculture Organization of the United Nations, EMPRES 1 GLOB AN DIS
[7]   The role of El Nino southern oscillation (ENSO) on variations of monthly Plasmodium falciparum malaria cases at the cayenne general hospital, 1996-2009, French Guiana [J].
Hanf, Matthieu ;
Adenis, Antoine ;
Nacher, Mathieu ;
Carme, Bernard .
MALARIA JOURNAL, 2011, 10
[8]   A global model of avian influenza prediction in wild birds: the importance of northern regions [J].
Herrick, Keiko A. ;
Huettmann, Falk ;
Lindgren, Michael A. .
VETERINARY RESEARCH, 2013, 44
[9]  
Hyndman R.J., 2012, forecast: Forecasting functions for time series and linear models
[10]   Zoonotic Transmission of Avian Influenza Virus (H5N1), Egypt, 2006-2009 [J].
Kandeel, Amr ;
Manoncourt, Serge ;
Abd el Kareem, Eman ;
Ahmed, Abdel-Nasser Mohamed ;
El-Refaie, Samir ;
Essmat, Hala ;
Tjaden, Jeffrey ;
de Mattos, Cecilia C. ;
Earhart, Kenneth C. ;
Marfin, Anthony A. ;
El-Sayed, Nasr .
EMERGING INFECTIOUS DISEASES, 2010, 16 (07) :1101-1107