Formation lithology classification using scalable gradient boosted decision trees

被引:148
|
作者
Dev, Vikrant A. [1 ]
Eden, Mario R. [1 ]
机构
[1] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
关键词
Lithology classification; Scalable gradient boosting; XGBoost; LightGBM; Cat Boost; MACHINE LEARNING-METHODS; PREDICTION;
D O I
10.1016/j.compchemeng.2019.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The classification of underground formation lithology is an important task in petroleum exploration and engineering since it forms the basis of geological research studies and reservoir parameter calculations. Hence, there have recently been increased efforts to automate lithology classification by incorporating various data science tools and principles. In this regard, efforts were made recently to evaluate machine learning methods to classify formation lithology by using data from the Daniudui gas field (DGF) and Hangjinqi gas field (HGF), both located in China. Although the boosted ensemble learners utilized in the studies performed well, there is still scope for improvement with respect to the prediction metrics. Additionally, the issue of scalability of some of these algorithms is also of concern. Hence, building upon the success of these algorithms in the previous studies, we tap into the state of the art of scalable ensemble decision tree algorithms, in our study. Specifically, we applied recently developed gradient boosted decision tree (GBDT) systems, namely, XGBoost, LightGBM and CatBoost, after combining well log data obtained from DGF and HGF. We compare their performance with random forests (RFs), AdaBoost and gradient boosting machines (GBMs) which serve as a baseline. We evaluated the algorithms using metrics such as the micro average, macro average and weighted average of precision (Pr), recall (Re) and F1-score (F1) on the test set after hyperparameter tuning. In our analysis, among the applied algorithms, we found that LightGBM possessed the highest metrics. Our work identifies LightGBM and CatBoost as good first-choice algorithms for the supervised classification of lithology when utilizing well log data. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:392 / 404
页数:13
相关论文
共 50 条
  • [1] GRADIENT BOOSTED DECISION TREES FOR LITHOLOGY CLASSIFICATION
    Dev, Vikrant A.
    Eden, Mario R.
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE ON FOUNDATIONS OF COMPUTER-AIDED PROCESS DESIGN, 2019, 47 : 113 - 118
  • [2] Automated system for Brain Tumour Detection and Classification using eXtreme Gradient Boosted Decision Trees
    Mudgal, Tushar Kant
    Jain, Siddhant
    Gupta, Aditya
    Gusain, Kunal
    2017 INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS (ICSOFTCOMP), 2017,
  • [3] Flexible loss functions for binary classification in gradient-boosted decision trees: An application to credit scoring
    Mushava, Jonah
    Murray, Michael
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [4] Classification of Biodegradable Substances Using Balanced Random Trees and Boosted C5.0 Decision Trees
    Elsayad, Alaa M.
    Nassef, Ahmed M.
    Al-Dhaifallah, Mujahed
    Elsayad, Khaled A.
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2020, 17 (24) : 1 - 22
  • [5] Transition-Aware Human Activity Recognition Using eXtreme Gradient Boosted Decision Trees
    Gusain, Kunal
    Gupta, Aditya
    Popli, Bhavya
    ADVANCED COMPUTING AND COMMUNICATION TECHNOLOGIES, 2018, 562 : 41 - 49
  • [6] Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection
    Hancock J.T.
    Khoshgoftaar T.M.
    SN Computer Science, 2021, 2 (4)
  • [7] Verifying the Value and Veracity of eXtreme Gradient Boosted Decision Trees on a Variety of Datasets
    Gupta, Aditya
    Gusain, Kunal
    Popli, Bhavya
    2016 11TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2016, : 457 - 462
  • [8] Predicting the algorithmic time complexity of single parametric algorithms using multiclass classification with gradient boosted trees
    Sharma, Deepak Kumar
    Vohra, Sumit
    Gupta, Tarun
    Goyal, Vipul
    2018 ELEVENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2018, : 6 - 11
  • [9] A Data-Driven Wall-Shear Stress Model for LES Using Gradient Boosted Decision Trees
    Radhakrishnan, Sarath
    Adu Gyamfi, Lawrence
    Miro, Arnau
    Font, Bernat
    Calafell, Joan
    Lehmkuhl, Oriol
    HIGH PERFORMANCE COMPUTING - ISC HIGH PERFORMANCE DIGITAL 2021 INTERNATIONAL WORKSHOPS, 2021, 12761 : 105 - 121
  • [10] Offshore application of landslide susceptibility mapping using gradient-boosted decision trees: a Gulf of Mexico case study
    Dyer, Alec S.
    Mark-Moser, MacKenzie
    Duran, Rodrigo
    Bauer, Jennifer R.
    NATURAL HAZARDS, 2024, 120 (07) : 6223 - 6244