A Comparative Study of Three Supervised Machine-Learning Algorithms for Classifying Carbonate Vuggy Facies in the Kansas Arbuckle Formation

被引:20
作者
Deng, Tianqi [1 ]
Xu, Chicheng [2 ]
Jobe, Dawn
Xu, Rui [1 ]
机构
[1] Univ Texas Austin, Hildebrand Dept Petr & Geosyst Engn, Austin, TX 78712 USA
[2] Aramco Serv Co, Aramco Res Ctr Houston, 16300 Pk Row, Houston, TX 77084 USA
来源
PETROPHYSICS | 2019年 / 60卷 / 06期
关键词
ROCK CLASSIFICATION; PREDICTION; SHALE; LOGS;
D O I
10.30632/PJV60N6-2019a8
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Diagenetic features, such as vugs, fractures and dolomite bodies can have significant impacts on carbonate reservoir quality. Challenges remain in characterizing these diagenetic features from well logs, as they are often mixed with changes in mineral and fluid concentrations. In this paper, a data-driven approach is developed to classify vuggy facics bascd on core and well logs from a key well penetrating the Arbuckle formation in Kansas. Three supervised machine-learning methods, namely artificial neural networks (ANN), support vector machines (SVM), and random forests (RF), are compared for their accuracy, stability, and computational efficiency. Hyperparameters are tuned using cross-validation and Bayesian optimization. Different feature selection methods and data labeling schemes arc also evaluated to optimize the prediction. Results indicate predicting a binary classification (vuggy/nonvuggy) presents an-80% accuracy, compared to a 65% accuracy using a five-class vug-size-based classification label. A direct input of well logs as training features is recommended instead of using derived petrophysical properties. Among the three machine-learning algorithms, ANN outperforms the other two methods for vug/nonvug detection, whereas for vug-size classification, RF is the best algorithm to apply. This work also suggests RF shows the least sensitivity to hyperparameters (i.e., maximum number of splits and minimum leaf sizes) according to the response surfaccs constructed via Bayesian optimization. For the dataset used in this study. SVM is the most computationally efficient algorithm.
引用
收藏
页码:838 / 853
页数:16
相关论文
共 33 条
[1]  
[Anonymous], 2011, P 14 INT C ARTIFICIA
[2]  
[Anonymous], 1985, P 26 ANN LOGG S DALL
[3]   A survey of cross-validation procedures for model selection [J].
Arlot, Sylvain ;
Celisse, Alain .
STATISTICS SURVEYS, 2010, 4 :40-79
[4]  
Bishop C. M., 2006, PATTERN RECOGNITION, DOI DOI 10.1117/1.2819119
[5]  
Bize-Forest N., 2018, T SPWLA 59 ANN LOGG
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]  
Clerke E.A., 2014, T SPWLA 55 ANN LOGG
[8]   Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information [J].
Cracknell, Matthew J. ;
Reading, Anya M. .
COMPUTERS & GEOSCIENCES, 2014, 63 :22-33
[9]   New method for quantification of vuggy porosity from digital optical borehole images as applied to the karstic Pleistocene limestone of the Biscayne aquifer, southeastern Florida [J].
Cunningham, KJ ;
Carlson, JI ;
Hurley, NF .
JOURNAL OF APPLIED GEOPHYSICS, 2004, 55 (1-2) :77-90
[10]   Textural and pore size analysis of carbonates from integrated core and nuclear magnetic resonance logging: An Arbuckle study [J].
Doveton, John ;
Watney, Lynn .
INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2015, 3 (01) :SA77-SA89