Machine learning approach in predicting water saturation using well data at "TM" Niger Delta

被引:0
作者
Adeogun, Oluwakemi Y. [1 ]
Abdulwaheed, Mukthar O. [1 ]
Adeoti, Lukumon [1 ]
Allo, Olawale J. [1 ]
Fasakin, Olawunmi O. [1 ]
Okunowo, Oluwafemi O. [1 ]
机构
[1] Univ Lagos, Dept Geosci, Akoka, Lagos, Nigeria
关键词
Water saturation; Machine learning; XGBoost; CatBoost; Gradient boosting; Niger Delta; Well logs; Reservoir characterization;
D O I
10.1016/j.sciaf.2025.e02596
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Accurate estimation of water saturation (Sw) is critical for hydrocarbon exploration reservoir management, as it reveals the proportion of pore spaces filled with hydrocarbons and water. Determining Sw could be challenging in the absence of core data or resistivity logs. This informed the use of machine learning (ML) techniques to predict Sw in the "TM" Field, Niger Delta, where missing resistivity log data poses a challenge. Five ML models (XGBoost, AdaBoost, CatBoost, LightGBM, and Gradient Boost) were deployed using well log data (caliper, gamma-ray, neutron, porosity, density, and shale volume) to estimate Sw at "TM" Field, Niger-Delta. The dataset includes 61,253 observations, which were split into training (70%) and testing (30%) sets. After preprocessing and correcting inconsistencies in the data, the five ML models were trained and hyperparameters tuned to optimize performance. The models were evaluated using standard statistical metrics: Root Mean Square Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R2). To validate the performance of these models, predicted Sw values were compared with those estimated from resistivity logs. Likewise the predicted Sw from the five ML models were plotted against the Sw estimated from the resistivity data not used in the prediction process to validate and determine the quality of the predicted Sw from the ML models. Among the five ML models tested, XGBoost exhibited the best performance, with the highest R2 value of 0.9992 and the lowest RMSE of 0.0071. Other models, such as CatBoost, LightGBM, and Gradient Boost, showed strong performance with correlation coefficients of 0.9785, 0.9732, and 0.9299, respectively, but were less accurate than XGBoost. AdaBoost, on the other hand, demonstrated the poorest performance with a correlation coefficient of 0.4381 and the highest RMSE of 0.2082. The cross plot of the predicted Sw from XGBoost's model and actual Sw from Archie's equation had the highest correlation coefficient of 0.9, providing quality prediction thereby aligning with the statistical metrics. Hence, this study has identified Xgboost to be a promising ML tool that could be used to efficiently predict Sw without the use of resistivity data at "TM" Field Niger Delta and this could be applied in other similar geological settings.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Predicting How Well Adolescents Get Along with Peers and Teachers: A Machine Learning Approach
    Ali, Farhan
    Ang, Rebecca P.
    JOURNAL OF YOUTH AND ADOLESCENCE, 2022, 51 (07) : 1241 - 1256
  • [32] Predicting How Well Adolescents Get Along with Peers and Teachers: A Machine Learning Approach
    Farhan Ali
    Rebecca P. Ang
    Journal of Youth and Adolescence, 2022, 51 : 1241 - 1256
  • [33] Predicting data science performance from log data: using machine learning
    Doleck, Tenzin
    Agand, Pedram
    Pirrotta, Dylan
    EDUCATION AND INFORMATION TECHNOLOGIES, 2025,
  • [34] Machine learning approach for data analysis and predicting coronavirus using COVID-19 India dataset
    Singh S.
    Ramkumar K.R.
    Kukkar A.
    International Journal of Business Intelligence and Data Mining, 2023, 24 (01) : 47 - 73
  • [35] Predicting failure pressure of corroded gas pipelines: A data-driven approach using machine learning
    Xiao, Rui
    Zayed, Tarek
    Meguid, Mohamed A.
    Sushama, Laxmi
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2024, 184 : 1424 - 1441
  • [36] Predicting Severity of Head Collision Events in Elite Soccer Using Preinjury Data: A Machine Learning Approach
    Tarzi, Gabriel
    Tarzi, Christopher
    Saha, Ashirbani
    Cusimano, Michael D.
    CLINICAL JOURNAL OF SPORT MEDICINE, 2023, 33 (02): : 165 - 171
  • [37] Predicting the likelihood of readmission in patients with ischemic stroke: An explainable machine learning approach using common data model data
    Hwang, Yu Seong
    Kim, Seongheon
    Yim, Inhyeok
    Park, Yukyoung
    Kang, Seonguk
    Jo, Heui Sug
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 195
  • [38] Predicting Childhood Obesity Based on Single and Multiple Well-Child Visit Data Using Machine Learning Classifiers
    Mondal, Pritom Kumar
    Foysal, Kamrul H. H.
    Norman, Bryan A. A.
    Gittner, Lisaann S. S.
    SENSORS, 2023, 23 (02)
  • [39] Predicting Water Quality Parameters in Lake Pontchartrain using Machine Learning
    Daniels, Alexis
    Koutsougeras, Cris
    5TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND DATA MINING (ICISDM 2021), 2021, : 28 - 33
  • [40] Machine learning approach to predicting the hysteresis of water retention curves of porous media
    Beriozkin, Arcady
    Anidjar, Or Haim
    Azaria, Amos
    Hazon, Noam
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237