Causally-Informed Deep Learning to Improve Climate Models and Projections

被引:18
作者
Iglesias-Suarez, Fernando [1 ]
Gentine, Pierre [2 ,3 ]
Solino-Fernandez, Breixo [1 ]
Beucler, Tom [4 ]
Pritchard, Michael [5 ,6 ]
Runge, Jakob [7 ,8 ]
Eyring, Veronika [1 ,9 ]
机构
[1] Deutsch Zentrum Luft & Raumfahrt eV DLR, Inst Atmospher Phys, Oberpfaffenhofen, Germany
[2] Columbia Univ, Ctr Learning Earth Artificial Intelligence & Phys, Dept Earth & Environm Engn, New York, NY USA
[3] Columbia Univ, Learning Earth Artificial Intelligence & Phys LEAP, Earth & Environm Engn, Earth & Environm Sci, New York, NY USA
[4] Univ Lausanne, Inst Earth Surface Dynam, Lausanne, Switzerland
[5] Univ Calif Irvine, Dept Earth Syst Sci, Irvine, CA USA
[6] NVIDIA Corp, Santa Clara, CA USA
[7] Deutsch Zentrum Luft & Raumfahrt eV DLR, Inst Data Sci, Jena, Germany
[8] Tech Univ Berlin, Inst Comp Engn & Microelect, Berlin, Germany
[9] Univ Bremen, Inst Environm Phys IUP, Bremen, Germany
基金
美国国家科学基金会; 欧洲研究理事会; 欧盟地平线“2020”;
关键词
climate modeling; causal discovery; deep learning; subgrid parameterization; convection; HIGH-RESOLUTION SIMULATION; CONVECTION; CIRCULATION; ATMOSPHERE; PARAMETERIZATION; TRANSITION; VERSIONS; SHALLOW;
D O I
10.1029/2023JD039202
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Climate models are essential to understand and project climate change, yet long-standing biases and uncertainties in their projections remain. This is largely associated with the representation of subgrid-scale processes, particularly clouds and convection. Deep learning can learn these subgrid-scale processes from computationally expensive storm-resolving models while retaining many features at a fraction of computational cost. Yet, climate simulations with embedded neural network parameterizations are still challenging and highly depend on the deep learning solution. This is likely associated with spurious non-physical correlations learned by the neural networks due to the complexity of the physical dynamical system. Here, we show that the combination of causality with deep learning helps removing spurious correlations and optimizing the neural network algorithm. To resolve this, we apply a causal discovery method to unveil causal drivers in the set of input predictors of atmospheric subgrid-scale processes of a superparameterized climate model in which deep convection is explicitly resolved. The resulting causally-informed neural networks are coupled to the climate model, hence, replacing the superparameterization and radiation scheme. We show that the climate simulations with causally-informed neural network parameterizations retain many convection-related properties and accurately generate the climate of the original high-resolution climate model, while retaining similar generalization capabilities to unseen climates compared to the non-causal approach. The combination of causal discovery and deep learning is a new and promising approach that leads to stable and more trustworthy climate simulations and paves the way toward more physically-based causal deep learning approaches also in other scientific disciplines. Climate models have biases compared to observations that have been present for a long time because certain processes, like convection, are only approximated using simplified methods. Neural networks can better represent these processes, but often learn incorrect connections leading to unreliable results and climate model crashes. To solve this, we used a new method that informs neural networks with causal drivers, therefore, respecting the underlying physical processes. By doing so, we developed more reliable and trustworthy neural networks, allowing us to accurately represent the climate of the original high-resolution simulation on which these neural networks were trained. Causal discovery unveils causal drivers of subgrid-scale processes across climates The causally-informed hybrid model runs stably and generates a climate close to the original high-resolution simulation Spurious correlations are evident in the non-causal parameterization, leading to underestimate feature importance of physical drivers
引用
收藏
页数:16
相关论文
共 88 条
[1]   Reverse Engineering the Tropical Precipitation-Buoyancy Relationship [J].
Ahmed, Fiaz ;
Neelin, J. David .
JOURNAL OF THE ATMOSPHERIC SCIENCES, 2018, 75 (05) :1587-1608
[2]   Moist Static Energy Budget of MJO-like Disturbances in the Atmosphere of a Zonally Symmetric Aquaplanet [J].
Andersen, Joseph Allan ;
Kuang, Zhiming .
JOURNAL OF CLIMATE, 2012, 25 (08) :2782-2804
[3]   Reflections and projections on a decade of climate science [J].
不详 .
NATURE CLIMATE CHANGE, 2021, 11 (04) :279-285
[4]   Global-scale convective aggregation: Implications for the Madden-Julian Oscillation [J].
Arnold, Nathan P. ;
Randall, David A. .
JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2015, 7 (04) :1499-1518
[5]   Structure of the Madden-Julian Oscillation in the Superparameterized CAM [J].
Benedict, James J. ;
Randall, David A. .
JOURNAL OF THE ATMOSPHERIC SCIENCES, 2009, 66 (11) :3277-3296
[6]  
Beucler T, 2024, Arxiv, DOI [arXiv:2112.08440, DOI 10.48550/ARXIV.2112.08440]
[7]   Quantifying Progress Across Different CMIP Phases With the ESMValTool [J].
Bock, L. ;
Lauer, A. ;
Schlund, M. ;
Barreiro, M. ;
Bellouin, N. ;
Jones, C. ;
Meehl, G. A. ;
Predoi, V. ;
Roberts, M. J. ;
Eyring, V. .
JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2020, 125 (21)
[8]   Clouds, circulation and climate sensitivity [J].
Bony, Sandrine ;
Stevens, Bjorn ;
Frierson, Dargan M. W. ;
Jakob, Christian ;
Kageyama, Masa ;
Pincus, Robert ;
Shepherd, Theodore G. ;
Sherwood, Steven C. ;
Siebesma, A. Pier ;
Sobel, Adam H. ;
Watanabe, Masahiro ;
Webb, Mark J. .
NATURE GEOSCIENCE, 2015, 8 (04) :261-268
[9]   Interpreting and Stabilizing Machine-Learning Parametrizations of Convection [J].
Brenowitz, Noah D. ;
Beucler, Tom ;
Pritchard, Michael ;
Bretherton, Christopher S. .
JOURNAL OF THE ATMOSPHERIC SCIENCES, 2020, 77 (12) :4357-4375
[10]   Spatially Extended Tests of a Neural Network Parametrization Trained by Coarse-Graining [J].
Brenowitz, Noah D. ;
Bretherton, Christopher S. .
JOURNAL OF ADVANCES IN MODELING EARTH SYSTEMS, 2019, 11 (08) :2728-2744