Predicting high resolution total phosphorus concentrations for soils of the Upper Mississippi River Basin using machine learning

被引:4
作者
Dolph, Christine L. [1 ]
Cho, Se Jong [2 ,3 ]
Finlay, Jacques C. [1 ]
Hansen, Amy T. [4 ]
Dalzell, Brent [5 ]
机构
[1] Univ Minnesota, Dept Ecol Evolut & Behav, Gortner Lab 140, 1479 Gortner Ave, St Paul, MN 55108 USA
[2] Univ Maryland, Natl Socio Environm Synth Ctr, Annapolis, MD 21401 USA
[3] US Geol Survey, Water Resources Mission Area, Reston, VA 20192 USA
[4] Univ Kansas, Civil Environm & Architectural Engn Dept, 1530 W 15Th St, Lawrence, KS 66045 USA
[5] US Dept Agr, ARS Soil & Water Management Res Unit, 439 Borlaug Hall,1991 Upper Buford Circle, St Paul, MN 55108 USA
关键词
Soil phosphorus; Modeling; Random forest; Conservation; Data mining; Water quality; NITROGEN; MANAGEMENT; DRAINAGE;
D O I
10.1007/s10533-023-01029-8
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The spatial distribution of soil phosphorus (P) is important to both biogeochemical processes and the management of agricultural landscapes, where it is critical for both crop production and conservation planning. Recent advances in the availability of large environmental datasets together with big data analytical tools like machine learning have created opportunities for evaluating and predicting spatial patterns in complex environmental variables like soil P. Here, we apply a random forest machine learning model to publicly available soil P datasets together with nearly 300 geospatial attributes summarizing aspects of soil type, land cover, land use, topography, nutrient inputs, and climate to predict total soil P at a 100 m grid scale for the Upper Mississippi River Basin (UMRB), USA. The UMRB is one of the most intensively farmed regions in the world and is characterized by widespread water quality degradation arising from P-associated eutrophication. Although potentially complex interacting drivers determine total soil P, the predictive accuracy of our random forest model was relatively high (R2 = 0.58 and RMSE = 129.3 for an independent validation dataset). At the regional scale represented by our model, the variables with the greatest comparative importance for predicting soil P included a combination of soil sample depth, land use/land cover, underlying soil physical and geochemical properties, landscape features (such as slope, elevation and proximity to the stream network), nutrient inputs, and climate-related factors. An important product of this research is a fine-scale (100 m) raster data layer of predicted total soil P values for the UMRB for public use. This dataset can be used to improve conservation planning and modeling efforts to improve water quality in the region.
引用
收藏
页码:289 / 310
页数:22
相关论文
共 70 条
[1]   Relationships between extractable soil phosphorus and phosphorus saturation after long-term fertilizer or manure application [J].
Allen, BL ;
Mallarino, AP .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 2006, 70 (02) :454-463
[2]  
[Anonymous], 2021, GRIDDED SOIL SURVEY
[3]  
[Anonymous], 2021, WAT BOUND DAT HUC07
[4]  
Berhe AA, 2018, ANNU REV EARTH PL SC, V46, P521, DOI [10.1146/annurev-earth-082517-010018, 10.1146/annurev-earth-082517010018]
[5]   Fertilizer, landscape features and climate regulate phosphorus retention and river export in diverse Midwestern watersheds [J].
Boardman, Evelyn ;
Danesh-Yazdi, Mohammad ;
Foufoula-Georgiou, Efi ;
Dolph, Christine L. ;
Finlay, Jacques C. .
BIOGEOCHEMISTRY, 2019, 146 (03) :293-309
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Burt R., 2014, Soil Survey Investigations Report No. 51
[8]   The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation [J].
Chicco, Davide ;
Warrens, Matthijs J. ;
Jurman, Giuseppe .
PEERJ COMPUTER SCIENCE, 2021,
[9]  
Clark B, 2018, MON REV, V70, P106
[10]   The role of phosphorus in the eutrophication of receiving waters: A review [J].
Correll, DL .
JOURNAL OF ENVIRONMENTAL QUALITY, 1998, 27 (02) :261-266