Spatiotemporal models of dengue epidemiology in the Philippines: Integrating remote sensing and interpretable machine learning

被引:2
作者
Buebos-Esteve, Don Enrico [1 ]
Dagamac, Nikki Heherson A. [1 ,2 ,3 ]
机构
[1] Univ Santo Tomas, Res Ctr Nat & Appl Sci, Initiat Conservat Landscape Ecol Bioprospecting &, Manila 1008, Philippines
[2] Univ Santo Tomas, Dept Biol Sci, Coll Sci, Manila 1008, Philippines
[3] Univ Santo Tomas, Grad Sch, Manila 1008, Philippines
关键词
Dengue virus (DENV); Explainable artificial intelligence; One health; Spatiotemporal epidemiology; Vector -borne disease; ARTIFICIAL-INTELLIGENCE; OPTIMIZATION; SELECTION; MANILA; FUTURE; RISK;
D O I
10.1016/j.actatropica.2024.107225
中图分类号
R38 [医学寄生虫学]; Q [生物科学];
学科分类号
07 ; 0710 ; 09 ; 100103 ;
摘要
Previous dengue epidemiological analyses have been limited in spatiotemporal extent or covariate dimensions, the latter neglecting the multifactorial nature of dengue. These constraints, caused by rigid and traditional statistical tools which collapse amidst 'Big Data', prompt interpretable machine-learning (iML) approaches. Predicting dengue incidence and mortality in the Philippines, a data-limited yet high-burden country, the mlr3 universe of R packages was used to build and optimize ML models based on remotely sensed provincial and dekadal 3 NDVI and 9 rainfall features from 2016 to 2020. Between two tasks, models differ across four random forest-based learners and two clustering strategies. Among 16 candidates, rfsrc-year-case and ranger-year-death significantly perform best for predicting dengue incidence and mortality, respectively. Therefore, temporal clustering yields the best models, reflective of dengue seasonality. The two best models were subjected to tripartite global exploratory model analyses, which encompass model-agnostic post-hoc methods such as Permutation Feature Importance (PFI) and Accumulated Local Effects (ALE). PFI reveals that the models differ in their important explanatory aspect, rainfall for rfsrc-year-case and NDVI for ranger-year-death, among which longterm average (lta) features are most relevant. Trend-wise, ALE reveals that average incidence predictions are positively associated with 'Rain.lta', reflective of dengue cases peaking during the wet season. In contrast, those for mortality are negatively associated with 'NDVI.lta', reflective of urban spaces driving dengue-related deaths. By technologically addressing the challenges of the human-animal-ecosystem interface, this study adheres to the One Digital Health paradigm operationalized under Sustainable Development Goals (SDGs). Leveraging data digitization and predictive modeling for epidemiological research paves SDG 3, which prioritizes holistic health and well-being.
引用
收藏
页数:14
相关论文
共 100 条
[1]  
Abaje I.B., 2014, Eur. Sci. J., V10
[2]  
Ahmad MA, 2018, ACM-BCB'18: PROCEEDINGS OF THE 2018 ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, P559, DOI [10.1145/3233547.3233667, 10.1109/ICHI.2018.00095]
[3]   Trends in dengue research in the Philippines: A systematic review [J].
An Agrupis, Kristal ;
Ylade, Michelle ;
Aldaba, Josephine ;
Lena Lopez, Anna ;
Deen, Jacqueline .
PLOS NEGLECTED TROPICAL DISEASES, 2019, 13 (04)
[4]   Visualizing the effects of predictor variables in black box supervised learning models [J].
Apley, Daniel W. ;
Zhu, Jingyu .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (04) :1059-1086
[5]   A Multi-Stage Machine Learning Approach to Predict Dengue Incidence: A Case Study in Mexico [J].
Appice, Annalisa ;
Gel, Yulia R. ;
Iliev, Iliyan ;
Lyubchich, Vyacheslav ;
Malerba, Donato .
IEEE ACCESS, 2020, 8 :52713-52725
[6]   Instance selection for regression by discretization [J].
Arnaiz-Gonzalez, Alvar ;
Diez-Pastor, Jose F. ;
Rodriguez, Juan J. ;
Ignacio Garcia-Osorio, Cesar .
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 54 :340-350
[7]   Ensemble modelling in descriptive epidemiology: burden of disease estimation [J].
Bannick, Marlena S. ;
McGaughey, Madeline ;
Flaxman, Abraham D. .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2020, 49 (06) :2065-2073
[8]   One Digital Health: A Unified Framework for Future Health Ecosystems [J].
Benis, Arriel ;
Tamburis, Oscar ;
Chronaki, Catherine ;
Moen, Anne .
JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (02)
[9]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[10]   What is Machine Learning? A Primer for the Epidemiologist [J].
Bi, Qifang ;
Goodman, Katherine E. ;
Kaminsky, Joshua ;
Lessler, Justin .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2019, 188 (12) :2222-2239