Formation lithology classification using scalable gradient boosted decision trees

被引:148
|
作者
Dev, Vikrant A. [1 ]
Eden, Mario R. [1 ]
机构
[1] Auburn Univ, Dept Chem Engn, Auburn, AL 36849 USA
关键词
Lithology classification; Scalable gradient boosting; XGBoost; LightGBM; Cat Boost; MACHINE LEARNING-METHODS; PREDICTION;
D O I
10.1016/j.compchemeng.2019.06.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The classification of underground formation lithology is an important task in petroleum exploration and engineering since it forms the basis of geological research studies and reservoir parameter calculations. Hence, there have recently been increased efforts to automate lithology classification by incorporating various data science tools and principles. In this regard, efforts were made recently to evaluate machine learning methods to classify formation lithology by using data from the Daniudui gas field (DGF) and Hangjinqi gas field (HGF), both located in China. Although the boosted ensemble learners utilized in the studies performed well, there is still scope for improvement with respect to the prediction metrics. Additionally, the issue of scalability of some of these algorithms is also of concern. Hence, building upon the success of these algorithms in the previous studies, we tap into the state of the art of scalable ensemble decision tree algorithms, in our study. Specifically, we applied recently developed gradient boosted decision tree (GBDT) systems, namely, XGBoost, LightGBM and CatBoost, after combining well log data obtained from DGF and HGF. We compare their performance with random forests (RFs), AdaBoost and gradient boosting machines (GBMs) which serve as a baseline. We evaluated the algorithms using metrics such as the micro average, macro average and weighted average of precision (Pr), recall (Re) and F1-score (F1) on the test set after hyperparameter tuning. In our analysis, among the applied algorithms, we found that LightGBM possessed the highest metrics. Our work identifies LightGBM and CatBoost as good first-choice algorithms for the supervised classification of lithology when utilizing well log data. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:392 / 404
页数:13
相关论文
共 50 条
  • [31] Beyond model interpretability using LDA and decision trees for α-amylase and α-glucosidase inhibitor classification studies
    Dieguez-Santana, Karel
    Rivera-Borroto, Oscar M.
    Puris, Amilkar
    Hai Pham-The
    Huong Le-Thi-Thu
    Rasulev, Bakhtiyor
    Casanola-Martin, Gerardo M.
    CHEMICAL BIOLOGY & DRUG DESIGN, 2019, 94 (01) : 1414 - 1421
  • [32] Age Classification of Rice Seeds in Japan Using Gradient-Boosting and ANFIS Algorithms
    Rathnayake, Namal
    Miyazaki, Akira
    Dang, Tuan Linh
    Hoshino, Yukinobu
    SENSORS, 2023, 23 (05)
  • [33] Boosted decision trees approach to neck alpha events discrimination in DEAP-3600 experiment
    Grobov, A.
    Ilyasov, A.
    PHYSICA SCRIPTA, 2020, 95 (07)
  • [34] Comparing several approaches for hierarchical classification of proteins with decision trees
    Costa, Eduardo P.
    Lorena, Ana C.
    Carvalho, Andre C. P. L. F.
    Freitas, Alex A.
    Holden, Nicholas
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2007, 4643 : 126 - +
  • [35] Evaluation of classification and decision trees in predicting daily precipitation occurrences
    Samadianfard, S.
    Mikaeili, F.
    Prasad, R.
    WATER SUPPLY, 2022, 22 (04) : 3879 - 3895
  • [36] An Efficacy of Spectral Features with Boosted Decision Tree Algorithm for Automatic Heart Sound Classification
    Arora, Vinay
    Leekha, Rohan Singh
    Chana, Inderveer
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2021, 11 (02) : 513 - 528
  • [37] Automated formatting verification technique of paperwork based on the gradient boosting on decision trees
    Nasyrov, Nail
    Komarov, Mikhail
    Tartynskikh, Petr
    Gorlushkina, Nataliya
    9TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE, YSC2020, 2020, 178 : 365 - 374
  • [38] Automatic detection of abnormal EEG signals using wavelet feature extraction and gradient boosting decision tree
    Albaqami, Hezam
    Hassan, Ghulam Mubashar
    Subasi, Abdulhamit
    Datta, Amitava
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2021, 70 (70)
  • [39] Detecting Transmembrane Proteins Using Decision Trees
    Nikravan, Mohammad Hossein
    Kumar, Ashwani
    Zilles, Sandra
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 146 - 160
  • [40] Classification trees for decision making in the social services with application to welfare recidivism
    Roback, PJ
    Welch, SM
    JOURNAL OF SOCIAL SERVICE RESEARCH, 2001, 27 (03) : 23 - 40