Process-Guided Deep Learning Predictions of Lake Water Temperature

被引:243
作者
Read, Jordan S. [1 ]
Jia, Xiaowei [2 ]
Willard, Jared [2 ]
Appling, Alison P. [1 ]
Zwart, Jacob A. [1 ]
Oliver, Samantha K. [1 ]
Karpatne, Anuj [3 ]
Hansen, Gretchen J. A. [4 ]
Hanson, Paul C. [5 ]
Watkins, William [1 ]
Steinbach, Michael [2 ]
Kumar, Vipin [2 ]
机构
[1] US Geol Survey, 959 Natl Ctr, Reston, VA 22092 USA
[2] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN USA
[3] Virginia Tech, Dept Comp Sci, Blacksburg, VA USA
[4] Univ Minnesota, Dept Fisheries Wildlife & Conservat Biol, Minneapolis, MN USA
[5] Univ Wisconsin, Ctr Limnol, Madison, WI 53706 USA
关键词
deep learning; lake modelling; temperature prediction; process-guided deep learning; theory-guided data science; data science; BIG DATA; NEURAL-NETWORK; DATA-DRIVEN; CLIMATE; SIMULATION; QUALITY; MODELS; FUTURE; FRAMEWORK; SCIENCE;
D O I
10.1029/2019WR024922
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The rapid growth of data in water resources has created new opportunities to accelerate knowledge discovery with the use of advanced deep learning tools. Hybrid models that integrate theory with state-of-the art empirical techniques have the potential to improve predictions while remaining true to physical laws. This paper evaluates the Process-Guided Deep Learning (PGDL) hybrid modeling framework with a use-case of predicting depth-specific lake water temperatures. The PGDL model has three primary components: a deep learning model with temporal awareness (long short-term memory recurrence), theory-based feedback (model penalties for violating conversation of energy), and model pretraining to initialize the network with synthetic data (water temperature predictions from a process-based model). In situ water temperatures were used to train the PGDL model, a deep learning (DL) model, and a process-based (PB) model. Model performance was evaluated in various conditions, including when training data were sparse and when predictions were made outside of the range in the training data set. The PGDL model performance (as measured by root-mean-square error (RMSE)) was superior to DL and PB for two detailed study lakes, but only when pretraining data included greater variability than the training period. The PGDL model also performed well when extended to 68 lakes, with a median RMSE of 1.65 degrees C during the test period (DL: 1.78 degrees C, PB: 2.03 degrees C; in a small number of lakes PB or DL models were more accurate). This case-study demonstrates that integrating scientific knowledge into deep learning tools shows promise for improving predictions of many important environmental variables.
引用
收藏
页码:9173 / 9190
页数:18
相关论文
共 86 条
  • [11] COTOJIMENEZ M, 2018, 2018 IEEE INT WORK C, P1
  • [12] Integrating a calibrated groundwater flow model with error-correcting data-driven models to improve predictions
    Demissie, Yonas K.
    Valocchi, Albert J.
    Minsker, Barbara S.
    Bailey, Barbara A.
    [J]. JOURNAL OF HYDROLOGY, 2009, 364 (3-4) : 257 - 271
  • [13] Iterative near-term ecological forecasting: Needs, opportunities, and challenges
    Dietze, Michael C.
    Fox, Andrew
    Beck-Johnson, Lindsay M.
    Betancourt, Julio L.
    Hooten, Mevin B.
    Jarnevich, Catherine S.
    Keitt, Timothy H.
    Kenney, Melissa A.
    Laney, Christine M.
    Larsen, Laurel G.
    Loescher, Henry W.
    Lunch, Claire K.
    Pijanowski, Bryan C.
    Randerson, James T.
    Read, Emily K.
    Tredennick, Andrew T.
    Vargas, Rodrigo
    Weathers, Kathleen C.
    White, Ethan P.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (07) : 1424 - 1432
  • [14] Challenges and design choices for global weather and climate models based on machine learning
    Dueben, Peter D.
    Bauer, Peter
    [J]. GEOSCIENTIFIC MODEL DEVELOPMENT, 2018, 11 (10) : 3999 - 4009
  • [15] Erhan D, 2010, J MACH LEARN RES, V11, P625
  • [16] Prolongation of SMAP to Spatiotemporally Seamless Coverage of Continental US Using a Deep Learning Neural Network
    Fang, Kuai
    Shen, Chaopeng
    Kifer, Daniel
    Yang, Xiao
    [J]. GEOPHYSICAL RESEARCH LETTERS, 2017, 44 (21) : 11030 - 11039
  • [17] An overview of current applications, challenges, and future trends in distributed process-based models in hydrology
    Fatichi, Simone
    Vivoni, Enrique R.
    Ogden, Fred L.
    Ivanov, Valeriy Y.
    Mirus, Benjamin
    Gochis, David
    Downer, Charles W.
    Camporese, Matteo
    Davison, Jason H.
    Ebel, Brian A.
    Jones, Norm
    Kim, Jongho
    Mascaro, Giuseppe
    Niswonger, Richard
    Restrepo, Pedro
    Rigon, Riccardo
    Shen, Chaopeng
    Sulis, Mauro
    Tarboton, David
    [J]. JOURNAL OF HYDROLOGY, 2016, 537 : 45 - 60
  • [18] Metamodeling for Groundwater Age Forecasting in the Lake Michigan Basin
    Fienen, Michael N.
    Nolan, B. Thomas
    Kauffman, Leon J.
    Feinstein, Daniel T.
    [J]. WATER RESOURCES RESEARCH, 2018, 54 (07) : 4750 - 4766
  • [19] Could Machine Learning Break the Convection Parameterization Deadlock?
    Gentine, P.
    Pritchard, M.
    Rasp, S.
    Reinaudi, G.
    Yacalis, G.
    [J]. GEOPHYSICAL RESEARCH LETTERS, 2018, 45 (11) : 5742 - 5751
  • [20] Gers FA, 1999, IEE CONF PUBL, P850, DOI [10.1162/089976600300015015, 10.1049/cp:19991218]