A Framework for the Cross-Validation of Categorical Geostatistical Simulations

被引:19
作者
Juda, Przemyslaw [1 ]
Renard, Philippe [1 ]
Straubhaar, Julien [1 ]
机构
[1] Univ Neuchatel, Ctr Hydrogeol & Geotherm, Neuchatel, Switzerland
基金
瑞士国家科学基金会;
关键词
POINT STATISTICAL SIMULATIONS; SELECTION;
D O I
10.1029/2020EA001152
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
The mapping of subsurface parameters and the quantification of spatial uncertainty requires selecting adequate models and their parameters. Cross-validation techniques have been widely used for geostatistical model selection for continuous variables, but the situation is different for categorical variables. In these cases, cross-validation is seldom applied, and there is no clear consensus on which method to employ. Therefore, this paper proposes a systematic framework for the cross-validation of geostatistical simulations of categorical variables such as geological facies. The method is based on K-fold cross-validation combined with a proper scoring rule. It can be applied whenever an observation data set is available. At each cross-validation iteration, the training set becomes conditioning data for the tested geostatistical model, and the ensemble of simulations is compared to true values. The proposed framework is generic. Its application is illustrated with two examples using multiple-point statistics simulations. In the first test case, the aim is to identify a training image from a given data set. In the second test case, the aim is to identify the parameters in a situation including nonstationarity for a coastal alluvial aquifer in the south of France. Cross-validation scores are used as metrics of model performance and quadratic scoring rule, zero-one score, and balanced linear score are compared. The study shows that the proposed fivefold stratified cross-validation with the quadratic scoring rule allows ranking the geostatistical models and helps to identify the proper parameters.
引用
收藏
页数:17
相关论文
共 44 条
[1]   Quantitative evaluation of multiple-point simulations using image segmentation and texture descriptors [J].
Abdollahifard, Mohammad Javad ;
Mariethoz, Gregoire ;
Ghavim, Maryam .
COMPUTATIONAL GEOSCIENCES, 2019, 23 (06) :1349-1368
[2]   Multiple-Point Geostatistical Lithofacies Simulation of Fluvial Sand-Rich Depositional Environment: A Case Study From Zubair Formation/South Rumaila Oil Field [J].
Al-Mudhafar, Watheq J. .
SPE RESERVOIR EVALUATION & ENGINEERING, 2018, 21 (01) :39-53
[3]   Probability Aggregation Methods in Geoscience [J].
Allard, D. ;
Comunian, A. ;
Renard, P. .
MATHEMATICAL GEOSCIENCES, 2012, 44 (05) :545-581
[4]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[5]   MPS-APO: a rapid and automatic parameter optimizer for multiple-point geostatistics [J].
Baninajar, Ehsanollah ;
Sharghi, Yousef ;
Mariethoz, Gregoire .
STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2019, 33 (11-12) :1969-1989
[6]  
Boisvert J., 2007, Nat. Resour. Res, V16, P313, DOI [10.1007/s11053-008-9058-9, DOI 10.1007/S11053-008-9058-9]
[7]   SUBMODEL SELECTION AND EVALUATION IN REGRESSION - THE X-RANDOM CASE [J].
BREIMAN, L ;
SPECTOR, P .
INTERNATIONAL STATISTICAL REVIEW, 1992, 60 (03) :291-319
[8]  
Breiman L., 2017, Classification and regression trees (the wadsworth statistics/probability series) chapman and hall, DOI 10.1201/9781315139470/CLASSIFICATION-REGRESSION-TREES-LEO-BREIMAN-JEROME-FRIEDMAN-RICHARD-OLSHEN-CHARLES-STONE
[9]  
Brier G. W., 1950, Monthly weather review, V78, P1, DOI [DOI 10.1175/1520-0493(1950)078LT
[10]  
0001:VOFEITGT