Combination of Machine Learning and Kriging for Spatial Estimation of Geological Attributes

被引:30
作者
Erdogan Erten, Gamze [1 ]
Yavuz, Mahmut [1 ]
Deutsch, Clayton V. [2 ]
机构
[1] Eskisehir Osmangazi Univ, Dept Min Engn, TR-26040 Eskisehir, Turkey
[2] Univ Alberta, Ctr Computat Geostat Donadeo Innovat Ctr Engn 6 2, 9211-116 St, Edmonton, AB T6G 1H9, Canada
关键词
Spatial estimation; Kriging; Machine learning; Super learner; Combination; Sequential quadratic programming; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; CROSS-VALIDATION; CLASSIFICATION; PREDICTION; ALGORITHMS; REGRESSION; RESERVOIR;
D O I
10.1007/s11053-021-10003-w
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
A growing number of studies in the spatial estimation of geological features use machine learning (ML) models, as these models promise to provide efficient solutions for estimation especially in non-Gaussian, non-stationary and complex cases. However, these models have two major limitations: (1) the data are considered to be independent and identically distributed (or spatially uncorrelated), and (2) the data are not reproduced at their locations. Kriging, on the other hand, has a long history of generating unbiased estimates with minimum error variance at unsampled locations. Kriging assumes stationarity and linearity. This study proposes a methodology that combines kriging and ML models to mitigate the disadvantages of each method and obtain more accurate estimates. In the proposed methodology, a stacked ensemble model, which is also referred to as the super learner (SL) model, is applied for ML modeling. We have shown how the estimates generated by the SL model and estimates obtained from kriging can be combined through a weighting function based on a kriging variance. The weights are optimized using the sequential quadratic programming. The proposed methodology is demonstrated in two synthetic case studies containing data with non-stationarity and non-Gaussian features; a real case study using a dataset from an oilsands deposit is also presented. The performance of the combined model is compared with the SL model and kriging using the coefficient of determination (R-squared), root mean squared error, and mean absolute error criteria. The combined model appears to yield more accurate estimates than the ones generated by SL model and kriging in all cases.
引用
收藏
页码:191 / 213
页数:23
相关论文
共 75 条
  • [1] Al-Anazi A., 2010, Natural Resources Research, V19, P125, DOI DOI 10.1007/S11053-010-9118-9
  • [2] Support vector regression for porosity prediction in a heterogeneous reservoir: A comparative study
    Al-Anazi, A. F.
    Gates, I. D.
    [J]. COMPUTERS & GEOSCIENCES, 2010, 36 (12) : 1494 - 1503
  • [3] Alpaydin E, 2014, ADAPT COMPUT MACH LE, P1
  • [4] Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression
    An, Senjian
    Liu, Wanquan
    Venkatesh, Svetha
    [J]. PATTERN RECOGNITION, 2007, 40 (08) : 2154 - 2162
  • [5] [Anonymous], 2009, Neural Networks and Learning Machines
  • [6] The application of median indicator kriging and neural network in modeling mixed population in an iron ore deposit
    Badel, Mehdi
    Angorani, Saeed
    Panahi, Masoud Shariat
    [J]. COMPUTERS & GEOSCIENCES, 2011, 37 (04) : 530 - 540
  • [7] The Effect of Splitting of Raw Data into Training and Test Subsets on the Accuracy of Predicting Spatial Distribution by a Multilayer Perceptron
    Baglaeva, E. M.
    Sergeev, A. P.
    Shichkin, A. V.
    Buevich, A. G.
    [J]. MATHEMATICAL GEOSCIENCES, 2020, 52 (01) : 111 - 121
  • [8] Bembom O, 2007, STAT APPL GENET MOL, V6
  • [9] Boggs P. T., 1995, ACTA NUMER, V4, P1, DOI [DOI 10.1017/S0962492900002518, 10.1017/S0962492900002518]
  • [10] Sequential quadratic programming for large-scale nonlinear optimization
    Boggs, PT
    Tolle, JW
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2000, 124 (1-2) : 123 - 137