Combination of Machine Learning and Kriging for Spatial Estimation of Geological Attributes

被引:30
作者
Erdogan Erten, Gamze [1 ]
Yavuz, Mahmut [1 ]
Deutsch, Clayton V. [2 ]
机构
[1] Eskisehir Osmangazi Univ, Dept Min Engn, TR-26040 Eskisehir, Turkey
[2] Univ Alberta, Ctr Computat Geostat Donadeo Innovat Ctr Engn 6 2, 9211-116 St, Edmonton, AB T6G 1H9, Canada
关键词
Spatial estimation; Kriging; Machine learning; Super learner; Combination; Sequential quadratic programming; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; CROSS-VALIDATION; CLASSIFICATION; PREDICTION; ALGORITHMS; REGRESSION; RESERVOIR;
D O I
10.1007/s11053-021-10003-w
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
A growing number of studies in the spatial estimation of geological features use machine learning (ML) models, as these models promise to provide efficient solutions for estimation especially in non-Gaussian, non-stationary and complex cases. However, these models have two major limitations: (1) the data are considered to be independent and identically distributed (or spatially uncorrelated), and (2) the data are not reproduced at their locations. Kriging, on the other hand, has a long history of generating unbiased estimates with minimum error variance at unsampled locations. Kriging assumes stationarity and linearity. This study proposes a methodology that combines kriging and ML models to mitigate the disadvantages of each method and obtain more accurate estimates. In the proposed methodology, a stacked ensemble model, which is also referred to as the super learner (SL) model, is applied for ML modeling. We have shown how the estimates generated by the SL model and estimates obtained from kriging can be combined through a weighting function based on a kriging variance. The weights are optimized using the sequential quadratic programming. The proposed methodology is demonstrated in two synthetic case studies containing data with non-stationarity and non-Gaussian features; a real case study using a dataset from an oilsands deposit is also presented. The performance of the combined model is compared with the SL model and kriging using the coefficient of determination (R-squared), root mean squared error, and mean absolute error criteria. The combined model appears to yield more accurate estimates than the ones generated by SL model and kriging in all cases.
引用
收藏
页码:191 / 213
页数:23
相关论文
共 75 条
  • [11] Breiman L, 1996, MACH LEARN, V24, P49
  • [12] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [13] Evaluation of machine learning methods for lithology classification using geophysical data
    Bressan, Thiago Santi
    de Souza, Marcelo Kehl
    Girelli, Tiago J.
    Chemale Junior, Farid
    [J]. COMPUTERS & GEOSCIENCES, 2020, 139
  • [14] Brownlee J., 2016, MACHINE LEARNING MAS
  • [15] Chatterjee S., 2011, NAT RESOUR RES, V20, P117, DOI [10.1007/s11053-011-9140-6, DOI 10.1007/S11053-011-9140-6]
  • [16] Ore Grade Prediction Using a Genetic Algorithm and Clustering Based Ensemble Neural Network Model
    Chatterjee, Snehamoy
    Bandopadhyay, Sukumar
    Machuca, David
    [J]. MATHEMATICAL GEOSCIENCES, 2010, 42 (03) : 309 - 326
  • [17] Chiles J.P., 2009, GEOSTATISTICS MODELI
  • [18] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [19] NEAREST NEIGHBOR PATTERN CLASSIFICATION
    COVER, TM
    HART, PE
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) : 21 - +
  • [20] Petrographic microfacies classification with deep convolutional neural networks
    de Lima, Rafael Pires
    Duarte, David
    Nicholson, Charles
    Slatt, Roger
    Marfurt, Kurt J.
    [J]. COMPUTERS & GEOSCIENCES, 2020, 142