Combination of Machine Learning and Kriging for Spatial Estimation of Geological Attributes

被引:35
作者
Erdogan Erten, Gamze [1 ]
Yavuz, Mahmut [1 ]
Deutsch, Clayton V. [2 ]
机构
[1] Eskisehir Osmangazi Univ, Dept Min Engn, TR-26040 Eskisehir, Turkey
[2] Univ Alberta, Ctr Computat Geostat Donadeo Innovat Ctr Engn 6 2, 9211-116 St, Edmonton, AB T6G 1H9, Canada
关键词
Spatial estimation; Kriging; Machine learning; Super learner; Combination; Sequential quadratic programming; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; CROSS-VALIDATION; CLASSIFICATION; PREDICTION; ALGORITHMS; REGRESSION; RESERVOIR;
D O I
10.1007/s11053-021-10003-w
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
A growing number of studies in the spatial estimation of geological features use machine learning (ML) models, as these models promise to provide efficient solutions for estimation especially in non-Gaussian, non-stationary and complex cases. However, these models have two major limitations: (1) the data are considered to be independent and identically distributed (or spatially uncorrelated), and (2) the data are not reproduced at their locations. Kriging, on the other hand, has a long history of generating unbiased estimates with minimum error variance at unsampled locations. Kriging assumes stationarity and linearity. This study proposes a methodology that combines kriging and ML models to mitigate the disadvantages of each method and obtain more accurate estimates. In the proposed methodology, a stacked ensemble model, which is also referred to as the super learner (SL) model, is applied for ML modeling. We have shown how the estimates generated by the SL model and estimates obtained from kriging can be combined through a weighting function based on a kriging variance. The weights are optimized using the sequential quadratic programming. The proposed methodology is demonstrated in two synthetic case studies containing data with non-stationarity and non-Gaussian features; a real case study using a dataset from an oilsands deposit is also presented. The performance of the combined model is compared with the SL model and kriging using the coefficient of determination (R-squared), root mean squared error, and mean absolute error criteria. The combined model appears to yield more accurate estimates than the ones generated by SL model and kriging in all cases.
引用
收藏
页码:191 / 213
页数:23
相关论文
共 75 条
[11]   Sequential quadratic programming for large-scale nonlinear optimization [J].
Boggs, PT ;
Tolle, JW .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2000, 124 (1-2) :123-137
[12]  
Breiman L, 1996, MACH LEARN, V24, P49
[13]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[14]   Evaluation of machine learning methods for lithology classification using geophysical data [J].
Bressan, Thiago Santi ;
de Souza, Marcelo Kehl ;
Girelli, Tiago J. ;
Chemale Junior, Farid .
COMPUTERS & GEOSCIENCES, 2020, 139
[15]  
Brownlee J., 2016, Machine Learning Mastery
[16]  
Chatterjee S., 2011, NAT RESOUR RES, V20, P117, DOI [10.1007/s11053-011-9140-6, DOI 10.1007/S11053-011-9140-6]
[17]   Ore Grade Prediction Using a Genetic Algorithm and Clustering Based Ensemble Neural Network Model [J].
Chatterjee, Snehamoy ;
Bandopadhyay, Sukumar ;
Machuca, David .
MATHEMATICAL GEOSCIENCES, 2010, 42 (03) :309-326
[18]  
Chiles Jean-Paul, 2009, Geostatistics: modeling spatial uncertainty, V497
[19]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[20]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+