Formation lithology classification using scalable gradient boosted decision trees

被引:148
|
作者
Dev, Vikrant A. [1 ]
Eden, Mario R. [1 ]
机构
[1] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
关键词
Lithology classification; Scalable gradient boosting; XGBoost; LightGBM; Cat Boost; MACHINE LEARNING-METHODS; PREDICTION;
D O I
10.1016/j.compchemeng.2019.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The classification of underground formation lithology is an important task in petroleum exploration and engineering since it forms the basis of geological research studies and reservoir parameter calculations. Hence, there have recently been increased efforts to automate lithology classification by incorporating various data science tools and principles. In this regard, efforts were made recently to evaluate machine learning methods to classify formation lithology by using data from the Daniudui gas field (DGF) and Hangjinqi gas field (HGF), both located in China. Although the boosted ensemble learners utilized in the studies performed well, there is still scope for improvement with respect to the prediction metrics. Additionally, the issue of scalability of some of these algorithms is also of concern. Hence, building upon the success of these algorithms in the previous studies, we tap into the state of the art of scalable ensemble decision tree algorithms, in our study. Specifically, we applied recently developed gradient boosted decision tree (GBDT) systems, namely, XGBoost, LightGBM and CatBoost, after combining well log data obtained from DGF and HGF. We compare their performance with random forests (RFs), AdaBoost and gradient boosting machines (GBMs) which serve as a baseline. We evaluated the algorithms using metrics such as the micro average, macro average and weighted average of precision (Pr), recall (Re) and F1-score (F1) on the test set after hyperparameter tuning. In our analysis, among the applied algorithms, we found that LightGBM possessed the highest metrics. Our work identifies LightGBM and CatBoost as good first-choice algorithms for the supervised classification of lithology when utilizing well log data. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:392 / 404
页数:13
相关论文
共 50 条
  • [21] Using Automatic Programming to Improve Gradient Boosting for Classification
    Olsson, Roland
    Acharya, Shubodha
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2022, PT I, 2023, 13588 : 242 - 253
  • [22] A Comparative Study between Frequency Ratio Model and Gradient Boosted Decision Trees with Greedy Dimensionality Reduction in Groundwater Potential Assessment
    Sachdeva, Shruti
    Kumar, Bijendra
    WATER RESOURCES MANAGEMENT, 2020, 34 (15) : 4593 - 4615
  • [23] Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees
    Brophy, Jonathan
    Lowd, Daniel
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [24] CLASSIFICATION AND PREDICTION BY DECISION TREES AND NEURAL NETWORKS
    Prochazka, Michal
    Kouril, Lukas
    Zelinka, Ivan
    MENDELL 2009, 2009, : 177 - 181
  • [25] Decision Trees and Forests in Classification of Credit Institutions
    E. P. Akishina
    V. V. Ivanov
    A. S. Prikazchikova
    Physics of Particles and Nuclei Letters, 2025, 22 (1) : 69 - 76
  • [26] Estimating the Hessian Matrix of Ranking Objectives for Stochastic Learning to Rank with Gradient Boosted Trees
    Kang, Jingwei
    de Rijke, Maarten
    Oosterhuis, Harrie
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2390 - 2394
  • [27] Flood-type classification in mountainous catchments using crisp and fuzzy decision trees
    Sikorska, Anna E.
    Viviroli, Daniel
    Seibert, Jan
    WATER RESOURCES RESEARCH, 2015, 51 (10) : 7959 - 7976
  • [28] Using interpretable gradient-boosted decision-tree ensembles to uncover novel dynamical relationships governing monsoon low-pressure systems
    Hunt, Kieran M. R.
    Turner, Andrew G.
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2024, 150 (758) : 1 - 24
  • [29] Big Data Analytics Framework Using Squirrel Search Optimized Gradient Boosted Decision Tree for Heart Disease Diagnosis
    Shaik, Kareemulla
    Ramesh, Janjhyam Venkata Naga
    Mahdal, Miroslav
    Rahman, Mohammad Zia Ur
    Khasim, Syed
    Kalita, Kanak
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [30] Effectiveness of Integrating Ensemble-Based Feature Selection and Novel Gradient Boosted Trees in Runoff Prediction: A Case Study in Vu Gia Thu Bon River Basin, Vietnam
    Aiyelokun, Oluwatobi
    Pham, Quoc Bao
    Aiyelokun, Oluwafunbi
    Linh, Nguyen Thi Thuy
    Roy, Tirthankar
    Anh, Duong Tran
    Lupikasza, Ewa
    PURE AND APPLIED GEOPHYSICS, 2024, 181 (05) : 1725 - 1744