Mapping the probability of ripened subsoils using Bayesian logistic regression with informative priors

被引:10
|
作者
Steinbuch, Luc [1 ,2 ,3 ]
Brus, Dick J. [4 ]
Heuvelink, Gerard B. M. [2 ,3 ]
机构
[1] Wageningen Environm Res Alterra, POB 47, NL-6700 AA Wageningen, Netherlands
[2] Wageningen Univ, Soil Geog & Landscape Grp, POB 47, NL-6700 AA Wageningen, Netherlands
[3] ISRIC World Soil Informat, POB 353, NL-6700 AJ Wageningen, Netherlands
[4] Wageningen Univ, Biometris, POB 16, NL-6700 AA Wageningen, Netherlands
关键词
Bayesian statistics; Binomial logistic regression; Soil ripening; Informative priors; Soil mapping; Soil mapping uncertainty; CLASSIFICATION; PERFORMANCE; AREA; MAP;
D O I
10.1016/j.geoderma.2017.12.010
中图分类号
S15 [土壤学];
学科分类号
0903 ; 090301 ;
摘要
One of the first soil forming processes in marine and fluviatile clay soils is ripening, the irreversible change of physical and chemical soil properties, especially consistency, under influence of air. We used Bayesian binomial logistic regression (BBLR) to update the map showing unripened subsoils for a reclamation area in the west of The Netherlands. Similar to conventional binomial logistic regression (BLR), in BBLR the binary target variable (the subsoil is ripened or unripened) is modelled by a Bernoulli distribution. The logit transform of the probability of success' parameter of the Bernoulli distribution was modelled as a linear combination of the covariates soil type, freeboard (the desired water level in the ditches, compared to surface level) and mean lowest groundwater table. To capture all available information, Bayesian statistics combines legacy data summarized in a 'prior' probability distribution for the regression coefficients with actual observations. Our research focused on quantifying the influence of priors with different information levels, in combination with different sample sizes, on the resulting parameters and maps. We combined subsamples of different size (ranging from 5% to 50% of the original dataset of 676 observations) with priors representing different levels of trust in legacy data and investigated the effect of sample size and prior distribution on map accuracy. The resulting posterior parameter distributions, calculated by Markov chain Monte Carlo simulation, vary in centrality as well as in dispersion, especially for the smaller datasets. More informative priors decreased dispersion and pushed posterior central values towards prior central values. Interestingly, the resulting probability maps were almost similar. However, the associated uncertainty maps were different: a more informative prior decreased prediction uncertainty. When using the 'overall accuracy' validation metric, we found an optimal value for the prior information level, indicating that the standard deviation of the legacy data regression parameters should be multiplied by 10. This effect is only detectable for smaller datasets. The Area Under Curve validation statistic did not provide a meaningful optimal multiplier for the standard deviation. Bayesian binomial logistic regression proved to be a flexible mapping tool but the accuracy gain compared to conventional logistic regression was marginal and may not outweigh the extra modelling and computing effort.
引用
收藏
页码:56 / 69
页数:14
相关论文
共 45 条
  • [31] The Impact of Using Informative Priors in a Bayesian Cost-Effectiveness Analysis: An Application of Endovascular versus Open Surgical Repair for Abdominal Aortic Aneurysms in High-Risk Patients
    McCarron, C. Elizabeth
    Pullenayegum, Eleanor M.
    Thabane, Lehana
    Goeree, Ron
    Tarride, Jean-Eric
    MEDICAL DECISION MAKING, 2013, 33 (03) : 437 - 450
  • [32] Soil erosion hazard mapping using Analytic Hierarchy Process and logistic regression: a case study of Haffouz watershed, central Tunisia
    Kachouri, Sameh
    Achour, Hammadi
    Abida, Habib
    Bouaziz, Samir
    ARABIAN JOURNAL OF GEOSCIENCES, 2015, 8 (06) : 4257 - 4268
  • [33] Debris flow susceptibility mapping in a portion of the Andes and Preandes of San Juan, Argentina using frequency ratio and logistic regression models
    Esper Angillieri, M. Y.
    EARTH SCIENCES RESEARCH JOURNAL, 2013, 17 (02) : 159 - 167
  • [34] Earthquake induced landslide susceptibility mapping using an integrated ensemble frequency ratio and logistic regression models in West Sumatera Province, Indonesia
    Umar, Zahrul
    Pradhan, Biswajeet
    Ahmad, Anuar
    Jebur, Mustafa Neamah
    Tehrany, Mahyat Shafapour
    CATENA, 2014, 118 : 124 - 135
  • [35] LANDSLIDE SUSCEPTIBILITY MAPPING USING LOGISTIC REGRESSION AND FREQUENCY RATIO APPROACHES, CASE STUDY FROM SOUK AHRAS REGION, N E ALGERIA
    Mahadadi, Fatna
    Boumezbeur, Abederrahmane
    ITALIAN JOURNAL OF ENGINEERING GEOLOGY AND ENVIRONMENT, 2020, 20 (01): : 43 - 51
  • [36] Landslide Susceptibility Mapping Using Logistic Regression Analysis along the Jinsha River and Its Tributaries Close to Derong and Deqin County, Southwestern China
    Sun, Xiaohui
    Chen, Jianping
    Bao, Yiding
    Han, Xudong
    Zhan, Jiewei
    Peng, Wei
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2018, 7 (11)
  • [37] GIS-based soil planar slide susceptibility mapping using logistic regression and neural networks: a typical red mudstone area in southwest China
    Zhang, Shuai
    Li, Can
    Peng, Jingyu
    Peng, Dalei
    Xu, Qiang
    Zhang, Qun
    Bate, Bate
    GEOMATICS NATURAL HAZARDS & RISK, 2021, 12 (01) : 852 - 879
  • [38] Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat-Turkey)
    Yilmaz, Isik
    COMPUTERS & GEOSCIENCES, 2009, 35 (06) : 1125 - 1138
  • [39] Landslide hazard mapping in the Constantine city, Northeast Algeria using frequency ratio, weighting factor, logistic regression, weights of evidence, and analytical hierarchy process methods
    Bourenane, Hamid
    Guettouche, Mohamed Said
    Bouhadad, Youcef
    Braham, Massinissa
    ARABIAN JOURNAL OF GEOSCIENCES, 2016, 9 (02) : 1 - 24
  • [40] GIS-based groundwater spring potential mapping in the Sultan Mountains (Konya, Turkey) using frequency ratio, weights of evidence and logistic regression methods and their comparison
    Ozdemir, Adnan
    JOURNAL OF HYDROLOGY, 2011, 411 (3-4) : 290 - 308