Flexible Bayesian Ensemble Machine Learning Framework for Predicting Local Ozone Concentrations

被引:36
作者
Ren, Xiang [1 ,2 ]
Mi, Zhongyuan [1 ,3 ]
Cai, Ting [1 ]
Nolte, Christopher G. [4 ]
Georgopoulos, Panos G. [1 ,2 ,3 ,5 ]
机构
[1] Rutgers State Univ, Environm & Occupat Hlth Sci Inst EOHSI, Piscataway, NJ 08854 USA
[2] Rutgers State Univ, Dept Chem & Biochem Engn, Piscataway, NJ 08854 USA
[3] Rutgers State Univ, Dept Environm Sci, New Brunswick, NJ 08901 USA
[4] US EPA, Ctr Environm Measurement & Modeling, Res Triangle Pk, NC 27711 USA
[5] Rutgers Sch Publ Hlth, Dept Environm & Occupat Hlth & Justice, Piscataway, NJ 08854 USA
关键词
ozone; interpretable machine learning; data fusion; exposure assessment; environmental and climate justice; USE REGRESSION-MODELS; EXPOSURE ASSESSMENT; PM2.5; RESOLUTION; CHINA;
D O I
10.1021/acs.est.1c04076
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
3D-grid-based chemical transport models, such as the Community Multiscale Air Quality (CMAQ) modeling system, have been widely used for predicting concentrations of ambient air pollutants. However, typical horizontal resolutions of nationwide CMAQ simulations (12 X 12 km(2)) cannot capture local-scale gradients for accurately assessing human exposures and environmental justice disparities. In this study, a Bayesian ensemble machine learning (BEML) framework, which integrates 13 learning algorithms, was developed for downscaling CMAQ estimates of ozone daily maximum 8 h averages to the census tract level, across the contiguous US, and was demonstrated for 2011. Three-stage hyper-parameter tuning and targeted validations were designed to ensure the ensemble model's ability to interpolate, extrapolate, and capture concentration peaks. The Shapley value metric from coalitional game theory was applied to interpret the drivers of subgrid gradients. The flexibility (transferability) of the 2011-trained BEML model was further tested by evaluating its ability to estimate fine-scale concentrations for other years (2012-2017) without retraining. To demonstrate the feasibility of using the BEML approach to strictly "data-limited" situations, the model was applied to downscale CMAQ outputs for a future-year scenario-based simulation that considers effects of variations in meteorology associated with climate change.
引用
收藏
页码:3871 / 3883
页数:13
相关论文
共 54 条
[21]   Comparison of multiple PM2.5 exposure products for estimating health benefits of emission controls over New York State, USA [J].
Jin, Xiaomeng ;
Fiore, Arlene M. ;
Civerolo, Kevin ;
Bi, Jianzhao ;
Liu, Yang ;
van Donkelaar, Aaron ;
Martin, Randall V. ;
Al-Hamdan, Mohammad ;
Zhang, Yuqiang ;
Insaf, Tabassum Z. ;
Kioumourtzoglou, Marianthi-Anna ;
He, Mike Z. ;
Kinney, Patrick L. .
ENVIRONMENTAL RESEARCH LETTERS, 2019, 14 (08)
[22]   Examining PM2.5 concentrations and exposure using multiple models [J].
Kelly, James T. ;
Jang, Carey ;
Timin, Brian ;
Di, Qian ;
Schwartz, Joel ;
Liu, Yang ;
van Donkelaar, Aaron ;
Martin, Randall V. ;
Berrocal, Veronica ;
Bell, Michelle L. .
ENVIRONMENTAL RESEARCH, 2021, 196
[23]   Long-term nitrogen dioxide exposure assessment using back-extrapolation of satellite-based land-use regression models for Australia [J].
Knibbs, Luke D. ;
Coorey, Craig. P. ;
Bechle, Matthew J. ;
Marshall, Julian D. ;
Hewson, Michael G. ;
Jalaludin, Bin ;
Morgan, Geoff G. ;
Barnett, Adrian G. .
ENVIRONMENTAL RESEARCH, 2018, 163 :16-25
[24]  
Koller D., 2009, Probabilistic graphical models: principles and techniques
[25]   Constrained Mixed-Effect Models with Ensemble Learning for Prediction of Nitrogen Oxides Concentrations at High Spatiotemporal Resolution [J].
Li, Lianfa ;
Lurmann, Fred ;
Habre, Rima ;
Urman, Robert ;
Rappaport, Edward ;
Ritz, Beate ;
Chen, Jiu-Chiuan ;
Gilliland, Frank D. ;
Wu, Jun .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2017, 51 (17) :9920-9929
[26]   Spatiotemporal distributions of surface ozone levels in China from 2005 to 2017: A machine learning approach [J].
Liu, Riyang ;
Ma, Zongwei ;
Liu, Yang ;
Shao, Yanchuan ;
Zhao, Wei ;
Bi, Jun .
ENVIRONMENT INTERNATIONAL, 2020, 142
[27]   National Empirical Models of Air Pollution Using Microscale Measures of the Urban Environment [J].
Lu, Tianjun ;
Marshall, Julian D. ;
Zhang, Wenwen ;
Hystad, Perry ;
Kim, Sun-Young ;
Bechle, Matthew J. ;
Demuzere, Matthias ;
Hankey, Steve .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2021, 55 (22) :15519-15530
[28]  
Lundberg SM, 2017, ADV NEUR IN, V30
[29]   From local explanations to global understanding with explainable AI for trees [J].
Lundberg, Scott M. ;
Erion, Gabriel ;
Chen, Hugh ;
DeGrave, Alex ;
Prutkin, Jordan M. ;
Nair, Bala ;
Katz, Ronit ;
Himmelfarb, Jonathan ;
Bansal, Nisha ;
Lee, Su-In .
NATURE MACHINE INTELLIGENCE, 2020, 2 (01) :56-67
[30]   Fusion Method Combining Ground-Level Observations with Chemical Transport Model Predictions Using an Ensemble Deep Learning Framework: Application in China to Estimate Spatiotemporally-Resolved PM2.5 Exposure Fields in 2014-2017 [J].
Lyu, Baolei ;
Hu, Yongtao ;
Zhang, Wenxian ;
Du, Yunsong ;
Luo, Bin ;
Sun, Xiaoling ;
Sun, Zhe ;
Deng, Zhu ;
Wang, Xiaojiang ;
Liu, Jun ;
Wang, Xuesong ;
Russell, Armistead G. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2019, 53 (13) :7306-7315