LandBench 1.0: A benchmark dataset and evaluation metrics for data-driven land surface variables prediction

被引：9

作者：

Li, Qingliang ^{[1
]}

Zhang, Cheng ^{[1
]}

Wei, Shangguan ^{[2
,3
]}

Wei, Zhongwang ^{[2
,3
]}

Yuan, Hua ^{[2
,3
]}

Zhu, Jinlong ^{[1
]}

Li, Xiaoning ^{[1
,4
]}

Li, Lu ^{[2
,3
]}

Li, Gan ^{[1
]}

Liu, Pingping ^{[1
,4
]}

Dai, Yongjiu ^{[2
,3
]}

机构：

[1] Changchun Normal Univ, Coll Comp Sci & Technol, Changchun 130032, Peoples R China

[2] Sun Yat Sen Univ, Sch Atmospher Sci, Southern Marine Sci & Engn Guangdong Lab Zhuhai, Guangdong Prov Key Lab Climate Change, Guangzhou, Guangdong, Peoples R China

[3] Sun Yat Sen Univ, Sch Atmospher Sci, Guangdong Prov Key Lab Climate Change & Nat Disast, Guangzhou, Guangdong, Peoples R China

[4] Jilin Univ, Coll Comp Sci & Technol, Changchun 130032, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2024年 / 243卷

基金：

中国国家自然科学基金;

关键词：

Land surface variables; Benchmark dataset; Deep learning; Soil moisture; SOIL-MOISTURE; NEURAL-NETWORK; RUNOFF; SATELLITE; PRODUCTS; EXTREMES;

D O I：

10.1016/j.eswa.2023.122917

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The advancements in deep learning methods have presented new opportunities and challenges for predicting land surface variables (LSVs) due to their similarity with computer sciences tasks. However, few researchers focus on the benchmark datasets for LSVs predictions that hampers fair comparisons of different data-driven deep learning models. Hence, we propose a LSVs benchmark dataset and prediction toolbox to boost research in datadriven LSVs modeling and improve the consistency of data-driven deep learning models for LSVs. LSVs benchmark dataset contains a large number of hydrology-related variables, such as global soil moisture, runoff, etc., which can verify the simulation of hydrological processes. Various global data from European Centre for Medium-Range Weather Forecasts reanalysis 5 (ERA5), ERA5-land, global gridded soil information (SoilGrid), soil moisture storage capacity (SMSC), and moderate-resolution imaging spectroradiometer (MODIS) datasets have been pre-processed into daily data at 0.5-, 1-, 2-, and 4-degree resolutions to facilitate their use in datadriven models. Simple statistical metrics, i.e., the root mean squared error and correlation coefficient, are chosen to evaluate the performance of different deep learning (DL) models, including convolutional neural network, long short-term memory and convolution long short-term memory models, with lead times of 1 and 5 days. A processed-based model serves as a physic baseline, soil moisture and surface sensible heat fluxes are taken as the target variables. The developed benchmark dataset and evaluation metrics for predicting LSVs using data-driven approaches, named as the LandBench toolbox, were implemented using Pytorch. This toolbox facilitates the reimplementation of existing methods, the development of novel predictive models, and the utilization of unified evaluation metrics. Additionally, the toolbox incorporates address mapping technology to enable high-resolution global predictions with constrained computing resources. We hope LandBench will not only serves as a standardized framework, fostering equitable model comparisons, but also provides indispensable data and a robust scientific foundation essential for advancing climate change research, disaster management, and sustainable development initiatives.

引用

页数：17

共 79 条

[1]

Anderson S., 2021, Rivers and Lakes/ Modelling approaches, DOI DOI 10.5194/HESS-2021-113

[2] Evaluation of 18 satellite- and model-based soil moisture products using in situ measurements from 826 sensors [J].

Beck, Hylke E. ;

Pan, Ming ;

Miralles, Diego G. ;

Reichle, Rolf H. ;

Dorigo, Wouter A. ;

Hahn, Sebastian ;

Sheffield, Justin ;

Karthikeyan, Lanka ;

Balsamo, Gianpaolo ;

Parinussa, Robert M. ;

van Dijk, Albert I. J. M. ;

Du, Jinyang ;

Kimball, John S. ;

Vergopolan, Noemi ;

Wood, Eric F. .

HYDROLOGY AND EARTH SYSTEM SCIENCES, 2021, 25 (01) :17-40

[3] Deep Learned Process Parameterizations Provide Better Representations of Turbulent Heat Fluxes in Hydrologic Models [J].

Bennett, Andrew ;

Nijssen, Bart .

WATER RESOURCES RESEARCH, 2021, 57 (05)

[4] AQ-Bench: a benchmark dataset for machine learning on global air quality metrics [J].

Betancourt, Clara ;

Stomberg, Timo ;

Roscher, Ribana ;

Schultz, Martin G. ;

Stadtler, Scarlet .

EARTH SYSTEM SCIENCE DATA, 2021, 13 (06) :3013-3033

[5]

Cao B., 2020, Frozen ground/Frozen Ground, DOI [10.5194/tc-2020-76, DOI 10.5194/TC-2020-76]

[6] A hybrid deep learning framework with physical process description for simulation of evapotranspiration [J].

Chen, Han ;

Huang, Jinhui Jeanne ;

Dash, Sonam Sandeep ;

Wei, Yizhao ;

Li, Han .

JOURNAL OF HYDROLOGY, 2022, 606

[7] An Empirical Study of Training Self-Supervised Vision Transformers [J].

Chen, Xinlei ;

Xie, Saining ;

He, Kaiming .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9620-9629

[8] SIMULATION OF THE IMPACTS OF CLIMATE-CHANGE ON RUNOFF AND SOIL-MOISTURE IN AUSTRALIAN CATCHMENTS [J].

CHIEW, FHS ;

WHETTON, PH ;

MCMAHON, TA ;

PITTOCK, AB .

JOURNAL OF HYDROLOGY, 1995, 167 (1-4) :121-147

[9]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[10] GSWP-2 - Multimodel anlysis and implications for our perception of the land surface [J].

Dirmeyer, Paul A. ;

Gao, Xiang ;

Zhao, Mei ;

Guo, Zhichang ;

Oki, Taikan ;

Hanasaki, Naota .

BULLETIN OF THE AMERICAN METEOROLOGICAL SOCIETY, 2006, 87 (10) :1381-+

← 1 2 3 4 5 6 7 8 →