Improving the prediction of an atmospheric chemistry transport model using gradient-boosted regression trees

被引:44
作者
Ivatt, Peter D. [1 ,2 ]
Evans, Mathew J. [1 ,2 ]
机构
[1] Univ York, Dept Chem, Wolfson Atmospher Chem Labs, York YO10 5DD, N Yorkshire, England
[2] Univ York, Dept Chem, Natl Ctr Atmospher Sci, York YO10 5DD, N Yorkshire, England
关键词
SURFACE OZONE; TROPOSPHERIC CHEMISTRY; EMISSIONS; BIAS; PERFORMANCE; METEOROLOGY; INVENTORY; FRAMEWORK; AIRCRAFT; IMPACT;
D O I
10.5194/acp-20-8063-2020
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Predictions from process-based models of environmental systems are biased, due to uncertainties in their inputs and parameterizations, reducing their utility. We develop a predictor for the bias in tropospheric ozone (O-3, a key pollutant) calculated by an atmospheric chemistry transport model (GEOS-Chem), based on outputs from the model and observations of ozone from both the surface (EPA, EMEP, and GAW) and the ozone-sonde networks. We train a gradient-boosted decision tree algorithm (XGBoost) to predict model bias (model divided by observation), with model and observational data for 2010-2015, and then we test the approach using the years 2016-2017. We show that the bias-corrected model performs considerably better than the uncorrected model. The root-mean-square error is reduced from 16.2 to 7.5 ppb, the normalized mean bias is reduced from 0.28 to -0.04, and Pearson's R is increased from 0.48 to 0.84. Comparisons with observations from the NASA ATom flights (which were not included in the training) also show improvements but to a smaller extent, reducing the rootmean-square error (RMSE) from 12.1 to 10.5 ppb, reducing the normalized mean bias (NMB) from 0.08 to 0.06, and increasing Pearson's R from 0.76 to 0.79. We attribute the smaller improvements to the lack of routine observational constraints for much of the remote troposphere. We show that the method is robust to variations in the volume of training data, with approximately a year of data needed to produce useful performance. Data denial experiments (removing observational sites from the algorithm training) show that information from one location (for example Europe) can reduce the model bias over other locations (for example North America) which might provide insights into the processes controlling the model bias. We explore the choice of predictor (bias prediction versus direct prediction) and conclude both may have utility. We conclude that combining machine learning approaches with process-based models may provide a useful tool for improving these models.
引用
收藏
页码:8063 / 8082
页数:20
相关论文
共 58 条
  • [1] Machine Learning Predictions of a Multiresolution Climate Model Ensemble
    Anderson, Gemma J.
    Lucas, Donald D.
    [J]. GEOPHYSICAL RESEARCH LETTERS, 2018, 45 (09) : 4273 - 4280
  • [2] [Anonymous], 2017, INT GEOS CHEM USER C
  • [3] The quiet revolution of numerical weather prediction
    Bauer, Peter
    Thorpe, Alan
    Brunet, Gilbert
    [J]. NATURE, 2015, 525 (7567) : 47 - 55
  • [4] Bergstra J, 2012, J MACH LEARN RES, V13, P281
  • [5] Global modeling of tropospheric chemistry with assimilated meteorology: Model description and evaluation
    Bey, I
    Jacob, DJ
    Yantosca, RM
    Logan, JA
    Field, BD
    Fiore, AM
    Li, QB
    Liu, HGY
    Mickley, LJ
    Schultz, MG
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2001, 106 (D19) : 23073 - 23095
  • [6] Top-down induction of first-order logical decision trees
    Blockeel, H
    De Raedt, L
    [J]. ARTIFICIAL INTELLIGENCE, 1998, 101 (1-2) : 285 - 297
  • [7] Data assimilation in atmospheric chemistry models: current status and future prospects for coupled chemistry meteorology models
    Bocquet, M.
    Elbern, H.
    Eskes, H.
    Hirtl, M.
    Zabkar, R.
    Carmichael, G. R.
    Flemming, J.
    Inness, A.
    Pagowski, M.
    Perez Camano, J. L.
    Saide, P. E.
    San Jose, R.
    Sofiev, M.
    Vira, J.
    Baklanov, A.
    Carnevale, C.
    Grell, G.
    Seigneur, C.
    [J]. ATMOSPHERIC CHEMISTRY AND PHYSICS, 2015, 15 (10) : 5325 - 5358
  • [8] Spectral analysis of atmospheric composition: application to surface ozone model-measurement comparisons
    Bowdalo, Dene R.
    Evans, Mathew J.
    Sofen, Eric D.
    [J]. ATMOSPHERIC CHEMISTRY AND PHYSICS, 2016, 16 (13) : 8295 - 8308
  • [9] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [10] Cawley GC, 2010, J MACH LEARN RES, V11, P2079