Causal Drivers of Land-Atmosphere Carbon Fluxes From Machine Learning Models and Data

被引:1
作者
Farahani, Mozhgan A. [1 ]
Goodwell, Allison E. [1 ,2 ]
机构
[1] Univ Colorado Denver, Dept Civil Engn, Denver, CO 80204 USA
[2] Univ Illinois, Prairie Res Inst, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
information theory; model evaluation; functional performance; carbon flux; machine learning; causal analysis; GROSS PRIMARY PRODUCTION; NET ECOSYSTEM EXCHANGE; NEURAL-NETWORKS; WATER FLUXES; BENCHMARKING; RESPIRATION; DIAGNOSTICS; DIOXIDE; ERROR;
D O I
10.1029/2023JG007815
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Interactions among atmospheric, root-soil, and vegetation processes drive carbon dioxide fluxes (Fc) from land to atmosphere. Eddy covariance measurements are commonly used to measure Fc at sub-daily timescales and validate process-based and data-driven models. However, these validations do not reveal process interactions, thresholds, and key differences in how models replicate them. We use information theory-based measures to explore multivariate information flow pathways from forcing data to observed and modeled hourly Fc, using flux tower data sets in the Midwestern U.S. in intensively managed corn-soybean landscapes. We compare multiple linear regressions, long-short term memory, and random forests (RF), and evaluate how different model structures use information from combinations of sources to predict Fc. We extend a framework for model predictive and functional performance, which examines a suite of dependencies from all forcing variables to the observed or modeled target. Of the three model types, RF exhibited the highest functional and predictive performance, correctly capturing strong dependencies between radiation and temperature variables with Fc. Regionally trained models demonstrate lower predictive but higher functional performance compared to site-specific models, suggesting superior reproduction of observed relationships at the expense of predictive accuracy. This study shows that some metrics of predictive performance encapsulate functional behaviors better than others, highlighting the need for multiple metrics of both types. This study improves our understanding of carbon fluxes in an intensively managed landscape, and more generally provides insight into how model structures and forcing variables translate to interactions that are well versus poorly captured in models. In an agricultural landscape, exchanges of carbon dioxide between the land and atmosphere occur due to photosynthesis and respiration and depend on weather, soil, and vegetation conditions. Traditional model performance metrics focus on the relationship between observed and modeled outputs, while functional performance considers the relationships between interacting inputs and outputs. We compare several performance measures for three different machine learning (ML) models that simulate sub-daily carbon fluxes. We look at how drivers such as solar radiation, soil moisture, temperature, humidity, and rainfall provide information to carbon fluxes, and whether different ML models also capture these interactions. In other words:Air, soil, and plants drive carbon's upward path, Models are detectives, interpreting their math. With information theory, we map data's travel courses, To see how models find or miss carbon's causal sources. Information theory measures describe individual and joint causal relationships in observed versus modeled vertical carbon dioxide fluxes Three machine learning models overestimate unique information from sources at the expense of synergistic, or joint information Regionally trained models have improved functional but not predictive performances, indicating a trade-off
引用
收藏
页数:23
相关论文
共 92 条
  • [1] Spatiotemporal patterns of terrestrial gross primary production: A review
    Anav, Alessandro
    Friedlingstein, Pierre
    Beer, Christian
    Ciais, Philippe
    Harper, Anna
    Jones, Chris
    Murray-Tortarolo, Guillermo
    Papale, Dario
    Parazoo, Nicholas C.
    Peylin, Philippe
    Piao, Shilong
    Sitch, Stephen
    Viovy, Nicolas
    Wiltshire, Andy
    Zhao, Maosheng
    [J]. REVIEWS OF GEOPHYSICS, 2015, 53 (03) : 785 - 818
  • [2] Statistical Mechanics and Information-Theoretic Perspectives on Complexity in the Earth System
    Balasis, Georgios
    Donner, Reik V.
    Potirakis, Stelios M.
    Runge, Jakob
    Papadimitriou, Constantinos
    Daglis, Ioannis A.
    Eftaxias, Konstantinos
    Kurths, Juergen
    [J]. ENTROPY, 2013, 15 (11) : 4844 - 4888
  • [3] Exploration of synergistic and redundant information sharing in static and dynamical Gaussian systems
    Barrett, Adam B.
    [J]. PHYSICAL REVIEW E, 2015, 91 (05):
  • [4] Parsimony vs predictive and functional performance of three stomatal optimization principles in a big-leaf framework
    Bassiouni, Maoya
    Vico, Giulia
    [J]. NEW PHYTOLOGIST, 2021, 231 (02) : 586 - 600
  • [5] Quantifying Process Connectivity With Transfer Entropy in Hydrologic Models
    Bennett, Andrew
    Nijssen, Bart
    Ou, Gengxin
    Clark, Martyn
    Nearing, Grey
    [J]. WATER RESOURCES RESEARCH, 2019, 55 (06) : 4613 - 4629
  • [6] The Plumbing of Land Surface Models: Benchmarking Model Performance
    Best, M. J.
    Abramowitz, G.
    Johnson, H. R.
    Pitman, A. J.
    Balsamo, G.
    Boone, A.
    Cuntz, M.
    Decharme, B.
    Dirmeyer, P. A.
    Dong, J.
    Ek, M.
    Guo, Z.
    Haverd, V.
    Van den Hurk, B. J. J.
    Nearing, G. S.
    Pak, B.
    Peters-Lidard, C.
    Santanello, J. A., Jr.
    Stevens, L.
    Vuichard, N.
    [J]. JOURNAL OF HYDROMETEOROLOGY, 2015, 16 (03) : 1425 - 1442
  • [7] Introduction to Focus Issue: Causation inference and information flow in dynamical systems: Theory and applications
    Bollt, Erik M.
    Sun, Jie
    Runge, Jakob
    [J]. CHAOS, 2018, 28 (07)
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Net ecosystem exchange of carbon dioxide in a temperate poor fen: a comparison of automated and manual chamber techniques
    Burrows, EH
    Bubier, JL
    Mosedale, A
    Cobb, GW
    Crill, PM
    [J]. BIOGEOCHEMISTRY, 2005, 76 (01) : 21 - 45
  • [10] Modeling Canopy Carbon and Water Fluxes Using a Multilayered Model over a Temperate Meadow in Inner Mongolia
    Chen, Nina
    Wang, Anzhi
    An, Juan
    Zhang, Yushu
    Ji, Ruipeng
    Jia, Qingyu
    Zhao, Ziqi
    Guan, Dexin
    [J]. INTERNATIONAL JOURNAL OF PLANT PRODUCTION, 2020, 14 (01) : 141 - 154