Spatial bagging for predictive machine learning uncertainty quantification

被引:0
作者
Ozbayrak, Fehmi [1 ]
Foster, John T. [1 ]
Pyrcz, Michael J. [1 ,2 ]
机构
[1] Univ Texas Austin, Cockrell Sch Engn, Hildebrand Dept Petr & Geosyst Engn, Chem & Petr Engn Bldg,200 E Dean Keeton St, Austin, TX 78712 USA
[2] Univ Texas Austin, Jackson Sch Geosci, Dept Earth & Planetary Sci, 2275 Speedway Stop C9000, Austin, TX 78712 USA
关键词
Spatial bagging; Ensemble learning; Effective sample size; Bootstrap methods; Uncertainty quantification; Uncertainty goodness;
D O I
10.1016/j.cageo.2025.105947
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Uncertainty quantification is a critical component in the interpretation of spatial phenomena, particularly within the geosciences, where incomplete subsurface data leads to various possible scenarios, making it crucial for risk assessment and decision-making. Traditional geostatistical methods have served as the cornerstone for uncertainty analysis; however, the incorporation of machine learning, particularly ensemble methods, offers a compelling augmentation, especially in handling complex and noisy datasets. Building on our previous work, which introduced a spatial bagging technique for enhancing prediction accuracy, this study extends the method to uncertainty quantification by applying a widely-used UQ metric from geostatistics. Our approach employs a bootstrap method adjusted for effective sample size derived from spatial statistics, addressing the common issue of overfitting when dealing with dependent data. We demonstrate, through a series of synthetic datasets with varied noise levels and spatial structures, that our spatial bagging method not only outperforms standard bagging techniques in prediction accuracy but also provides superior uncertainty quantification. The robustness of the method against noise and its computational efficiency, particularly in spatially correlated data, positions it as a promising tool for geoscientists and others who require reliable uncertainty measures in spatial analysis.
引用
收藏
页数:10
相关论文
共 25 条
[1]  
[Anonymous], 1992, Bootstrap methods: another look at the
[2]  
Babak O, 2008, J CAN PETROL TECHNOL, V47, P41
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[5]  
Chernick M.R., 2011, Bootstrap methods: A guide for practitioners and researchers
[6]  
Chiles J-P., 2012, Geostatistics: Modeling Spatial Uncertainty, V713
[7]  
Deutsch C.V., 2004, Cent. Comput. Geostat. Annu. Rep., V6
[8]  
Deutsch CV, 1997, QUANT GEO G, V8, P115
[9]   Ensemble methods in machine learning [J].
Dietterich, TG .
MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 :1-15
[10]  
Efron B., 1993, An introduction to the bootstrap