Machine learning approach in predicting water saturation using well data at "TM" Niger Delta

被引:0
作者
Adeogun, Oluwakemi Y. [1 ]
Abdulwaheed, Mukthar O. [1 ]
Adeoti, Lukumon [1 ]
Allo, Olawale J. [1 ]
Fasakin, Olawunmi O. [1 ]
Okunowo, Oluwafemi O. [1 ]
机构
[1] Univ Lagos, Dept Geosci, Akoka, Lagos, Nigeria
关键词
Water saturation; Machine learning; XGBoost; CatBoost; Gradient boosting; Niger Delta; Well logs; Reservoir characterization;
D O I
10.1016/j.sciaf.2025.e02596
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate estimation of water saturation (Sw) is critical for hydrocarbon exploration reservoir management, as it reveals the proportion of pore spaces filled with hydrocarbons and water. Determining Sw could be challenging in the absence of core data or resistivity logs. This informed the use of machine learning (ML) techniques to predict Sw in the "TM" Field, Niger Delta, where missing resistivity log data poses a challenge. Five ML models (XGBoost, AdaBoost, CatBoost, LightGBM, and Gradient Boost) were deployed using well log data (caliper, gamma-ray, neutron, porosity, density, and shale volume) to estimate Sw at "TM" Field, Niger-Delta. The dataset includes 61,253 observations, which were split into training (70%) and testing (30%) sets. After preprocessing and correcting inconsistencies in the data, the five ML models were trained and hyperparameters tuned to optimize performance. The models were evaluated using standard statistical metrics: Root Mean Square Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R2). To validate the performance of these models, predicted Sw values were compared with those estimated from resistivity logs. Likewise the predicted Sw from the five ML models were plotted against the Sw estimated from the resistivity data not used in the prediction process to validate and determine the quality of the predicted Sw from the ML models. Among the five ML models tested, XGBoost exhibited the best performance, with the highest R2 value of 0.9992 and the lowest RMSE of 0.0071. Other models, such as CatBoost, LightGBM, and Gradient Boost, showed strong performance with correlation coefficients of 0.9785, 0.9732, and 0.9299, respectively, but were less accurate than XGBoost. AdaBoost, on the other hand, demonstrated the poorest performance with a correlation coefficient of 0.4381 and the highest RMSE of 0.2082. The cross plot of the predicted Sw from XGBoost's model and actual Sw from Archie's equation had the highest correlation coefficient of 0.9, providing quality prediction thereby aligning with the statistical metrics. Hence, this study has identified Xgboost to be a promising ML tool that could be used to efficiently predict Sw without the use of resistivity data at "TM" Field Niger Delta and this could be applied in other similar geological settings.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Predicting and Analyzing Water Quality using Machine Learning: A Comprehensive Model
    Khan, Yafra
    See, Chai Soo
    2016 IEEE LONG ISLAND SYSTEMS, APPLICATIONS AND TECHNOLOGY CONFERENCE (LISAT), 2016,
  • [42] Predicting Affective States Using Wearable Technology: A Machine Learning Approach
    Biri, Gergely
    Birkenmaier, Dennis
    Schroth, Marc
    Hu, Tao
    Hoffmann, Maximilian
    Giurgiu, Marco
    Woll, Simon
    Stork, Wilhelm
    2024 IEEE INTERNATIONAL WORKSHOP ON SPORT, TECHNOLOGY AND RESEARCH, STAR 2024, 2024, : 199 - 204
  • [43] Predicting the body weight of Balochi sheep using a machine learning approach
    Huma, Zil E.
    Iqbal, Farhat
    TURKISH JOURNAL OF VETERINARY & ANIMAL SCIENCES, 2019, 43 (04) : 500 - 506
  • [44] Predicting osteoarthritis in adults using statistical data mining and machine learning
    Bertoncelli, Carlo M.
    Altamura, Paola
    Bagui, Sikha
    Bagui, Subhash
    Vieira, Edgar Ramos
    Costantini, Stefania
    Monticone, Marco
    Solla, Federico
    Bertoncelli, Domenico
    THERAPEUTIC ADVANCES IN MUSCULOSKELETAL DISEASE, 2022, 14
  • [45] Predicting video virality and viewer engagement: a biometric data and machine learning approach
    Bacic, Dinko
    Gilstrap, Curt
    BEHAVIOUR & INFORMATION TECHNOLOGY, 2024, 43 (12) : 2854 - 2880
  • [46] Predicting patient outcomes in psychiatric hospitals with routine data: a machine learning approach
    Wolff, J.
    Gary, A.
    Jung, D.
    Normann, C.
    Kaier, K.
    Binder, H.
    Domschke, K.
    Klimke, A.
    Franz, M.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (01)
  • [47] A data-driven approach to predicting diabetes and cardiovascular disease with machine learning
    Dinh, An
    Miertschin, Stacey
    Young, Amber
    Mohanty, Somya D.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (01)
  • [48] Predicting subnational GDP in Vietnam with remote sensing data: a machine learning approach
    Suleiman, Hussein
    Nguyen, Minh-Thu Thi
    Mendez, Carlos
    LETTERS IN SPATIAL AND RESOURCE SCIENCES, 2025, 18 (01)
  • [49] Predicting patient outcomes in psychiatric hospitals with routine data: a machine learning approach
    J. Wolff
    A. Gary
    D. Jung
    C. Normann
    K. Kaier
    H. Binder
    K. Domschke
    A. Klimke
    M. Franz
    BMC Medical Informatics and Decision Making, 20
  • [50] Data Balancing Techniques for Predicting Student Dropout Using Machine Learning
    Mduma, Neema
    DATA, 2023, 8 (03)