An ensemble forecast model of dengue in Guangzhou, China using climate and social media surveillance data

被引:25
作者
Guo, Pi [1 ]
Zhang, Qin [2 ]
Chen, Yuliang [1 ]
Xiao, Jianpeng [3 ]
He, Jianfeng [4 ]
Zhang, Yonghui [4 ]
Wang, Li [5 ]
Liu, Tao [3 ]
Ma, Wenjun [3 ]
机构
[1] Shantou Univ, Med Coll, Dept Prevent Med, 22 Xinling Rd, Shantou 515041, Peoples R China
[2] Shantou Univ, Med Coll, Canc Hosp, Good Clin Practice Off, Shantou 515041, Peoples R China
[3] Guangdong Prov Ctr Dis Control & Prevent, Guangdong Prov Inst Publ Hlth, Guangzhou 511430, Guangdong, Peoples R China
[4] Guangdong Prov Ctr Dis Control & Prevent, Guangzhou 511430, Guangdong, Peoples R China
[5] Chinese Acad Sci, Shenzhen Univ Town, Shenzhen Inst Adv Technol, 1068 Xueyuan Ave, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Dengue; Ensemble model; Forecast; Outbreak; Real time; SELECTION;
D O I
10.1016/j.scitotenv.2018.08.044
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Background: China experienced an unprecedented outbreak of dengue in 2014, and the number of dengue cases reached the highest level over the past 25 years. There is a significant delay in the release of official case count data, and our ability to timely track the timing and magnitude of local outbreaks of dengue remains limited. Material and methods: We developed an ensemble penalized regression algorithm (EPRA) for initializing near-real time forecasts of the dengue epidemic trajectory by integrating different penalties (LASSO, Ridge, Elastic Net, SCAD and MCP) with the techniques of iteratively sampling and model averaging. Multiple streams of near-real time data including dengue-related Baidu searches, Sina Weibo posts, and climatic conditions with historical dengue incidence were used. We compared the predictive power of the EPRA with the alternates, penalized regression models using single penalties, to retrospectively forecast weekly dengue incidence and detect outbreak occurrence defined using different cutoffs, during the periods of 2011-2016 in Guangzhou, south China. Results: The EPRA showed the best or at least comparable performance for 1-, 2-week ahead out-of-sample and leave-one-out cross validation forecasts. The findings indicate that skillful near-real time forecasts of dengue and confidence in those predictions can be made. For detecting dengue outbreaks, the EPRA predicted periods of high incidence of dengue more accurately than the alternates. Conclusion: This study developed a statistically rigorous approach for near-real time forecast of dengue in China. The EPRA provides skillful forecasts and can be used as timely and complementary ways to assess dengue dynamics, which will help to design interventions to mitigate dengue transmission. (c) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:752 / 762
页数:11
相关论文
共 34 条
[1]   Prediction of Dengue Incidence Using Search Query Surveillance [J].
Althouse, Benjamin M. ;
Ng, Yih Yng ;
Cummings, Derek A. T. .
PLOS NEGLECTED TROPICAL DISEASES, 2011, 5 (08)
[2]   Elastic SCAD as a novel penalization method for SVM classification tasks in high-dimensional data [J].
Becker, Natalia ;
Toedt, Grischa ;
Lichter, Peter ;
Benner, Axel .
BMC BIOINFORMATICS, 2011, 12
[3]   The global distribution and burden of dengue [J].
Bhatt, Samir ;
Gething, Peter W. ;
Brady, Oliver J. ;
Messina, Jane P. ;
Farlow, Andrew W. ;
Moyes, Catherine L. ;
Drake, John M. ;
Brownstein, John S. ;
Hoen, Anne G. ;
Sankoh, Osman ;
Myers, Monica F. ;
George, Dylan B. ;
Jaenisch, Thomas ;
Wint, G. R. William ;
Simmons, Cameron P. ;
Scott, Thomas W. ;
Farrar, Jeremy J. ;
Hay, Simon I. .
NATURE, 2013, 496 (7446) :504-507
[4]  
Breiman L., 1996, Bagging Predictors
[5]  
Brownstein JS, 2009, NEW ENGL J MED, V360, P2153, DOI 10.1056/NEJMp0904012
[6]   Dengue fever in China [J].
Chen, Bin ;
Liu, Qiyong .
LANCET, 2015, 385 (9978) :1621-1622
[7]   Pandemics in the Age of Twitter: Content Analysis of Tweets during the 2009 H1N1 Outbreak [J].
Chew, Cynthia ;
Eysenbach, Gunther .
PLOS ONE, 2010, 5 (11)
[8]  
Diseases PFIT WH Organization, 2009, DENG GUID DIAGN TREA, V6, P990
[9]  
Eysenbach Gunther, 2006, AMIA Annu Symp Proc, P244
[10]   HealthMap: Global infectious disease monitoring through automated classification and visualization of Internet media reports [J].
Freifeld, Clark C. ;
Mandl, Kenneth D. ;
Ras, Ben Y. ;
Bronwnstein, John S. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (02) :150-157