Physics-guided machine learning from simulated data with different physical parameters

被引:17
作者
Chen, Shengyu [1 ]
Kalanat, Nasrin [1 ]
Xie, Yiqun [3 ]
Li, Sheng [4 ]
Zwart, Jacob A. [5 ]
Sadler, Jeffrey M. [5 ,7 ]
Appling, Alison P. [5 ]
Oliver, Samantha K. [6 ]
Read, Jordan S. [5 ,8 ]
Jia, Xiaowei [2 ]
机构
[1] Univ Pittsburgh, Comp Sci, Pittsburgh, PA USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15260 USA
[3] Univ Maryland, Geospatial Informat Sci, College Pk, MD USA
[4] Univ Virginia, Sch Data Sci, Charlottesville, VA USA
[5] US Geol Survey, Reston, VA USA
[6] US Geol Survey, Upper Midwest Water Sci Ctr, Reston, VA USA
[7] Oklahoma State Univ, Stillwater, OK USA
[8] Consortium Univ Advancement Hydrol Sci Inc, Arlington, MA USA
基金
美国国家科学基金会;
关键词
Physics-guided machine learning; Spatio-temporal data; Deep learning; Freshwater science; Stream networks; Simulated data; DEBATES-THE FUTURE; HYDROLOGICAL SCIENCES; COMMON PATH; NEURAL-NETWORKS; CLIMATE-CHANGE; BIG DATA; MODELS; SIMILARITY;
D O I
10.1007/s10115-023-01864-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Physics-based models are widely used to study dynamical systems in a variety of scientific and engineering problems. However, these models are necessarily approximations of reality due to incomplete knowledge or excessive complexity in modeling underlying processes. As a result, they often produce biased simulations due to inaccurate parameterizations or approximations used to represent the true physics. In this paper, we aim to build a new physics-guided machine learning framework to monitor dynamical systems. The idea is to use advanced machine learning model to extract complex spatio-temporal data patterns while also incorporating general scientific knowledge embodied in simulated data generated by the physics-based model. To handle the bias in simulated data caused by imperfect parameterization, we propose to extract general physical relations jointly from multiple sets of simulations generated by a physics-based model under different physical parameters. In particular, we develop a spatio-temporal network architecture that uses its gating variables to capture the variation of physical parameters. We initialize this model using a pre-training strategy that helps discover common physical patterns shared by different sets of simulated data. Then, we fine-tune it combining limited observations and adequate simulations. By leveraging the complementary strength of machine learning and domain knowledge, our method has been shown to produce accurate predictions, use less training samples and generalize to out-of-sample scenarios. We further show that the method can provide insights about the variation of physical parameters over space and time in two domain applications: predicting temperature in streams and predicting temperature in lakes.
引用
收藏
页码:3223 / 3250
页数:28
相关论文
共 67 条
[1]  
Anderson Brandon, 2019, NEURIPS
[2]  
[Anonymous], 2017, ARXIV
[3]  
[Anonymous], GRIDMET
[4]   Active learning via collective inference in network regression problems [J].
Appice, Annalisa ;
Loglisci, Corrado ;
Malerba, Donato .
INFORMATION SCIENCES, 2018, 460 :293-317
[5]  
Bao TS, 2022, PR MACH LEARN RES, V180, P118
[6]   Partial Differential Equation Driven Dynamic Graph Networks for Predicting Stream Water Temperature [J].
Bao, Tianshu ;
Jia, Xiaowei ;
Zwart, Jacob ;
Sadler, Jeffrey ;
Appling, Alison ;
Oliver, Samantha ;
Johnson, Taylor T. .
2021 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2021), 2021, :11-20
[7]   Effect of groundwater springs on NO3 concentrations during summer in Catskill Mountain streams [J].
Burns, DA ;
Murdoch, PS ;
Lawrence, GB ;
Michel, RL .
WATER RESOURCES RESEARCH, 1998, 34 (08) :1987-1996
[8]   Big data:: The next Google [J].
Graham-Rowe, Duncan ;
Buxton, Bill ;
Hayward, Vincent ;
Pearson, Ian ;
Karkkainen, Leo ;
Greiner, Helen ;
Dyson, Esther ;
Ito, Joi ;
Chung, Anshe ;
Kelly, Kevin ;
Schillace, Sam .
NATURE, 2008, 455 (7209) :8-U9
[9]   Reconstructing High-resolution Turbulent Flows Using Physics-Guided Neural Networks [J].
Chen, Shengyu ;
Sammak, Shervin ;
Givi, Peyman ;
Yurko, Joseph P. ;
Jia, Xiaowei .
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, :1369-1379
[10]   The impact of new land surface physics on the GCM simulation of climate and climate sensitivity [J].
Cox, PM ;
Betts, RA ;
Bunton, CB ;
Essery, RLH ;
Rowntree, PR ;
Smith, J .
CLIMATE DYNAMICS, 1999, 15 (03) :183-203