Super learner approach to predict total organic carbon using stacking machine learning models based on well logs

被引:16
作者
Goliatt, L. [1 ]
Saporetti, C. M. [2 ]
Pereira, E. [3 ]
机构
[1] Univ Fed Juiz de Fora, Dept Computat & Appl Mech, Juiz De Fora, Brazil
[2] Univ Estado Rio De Janeiro, Polytech Inst, Dept Computat Modeling, Nova Friburgo, Brazil
[3] Univ Estado Rio De Janeiro, Dept Stratig & Paleontol, Rio De Janeiro, Brazil
关键词
Machine learning; Stacking model; Total organic carbon; Geology; Artificial intelligence; PARS GAS-FIELD; INTELLIGENT SYSTEMS; COMMITTEE MACHINE; SONGLIAO BASIN; NEURAL-NETWORK; SHALE GAS; PERMEABILITY; EXAMPLE;
D O I
10.1016/j.fuel.2023.128682
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Determining the total organic carbon (TOC) content is essential information for risk assessment in oil exploration, as it is a parameter used for the characterization of hydrocarbon-generating rocks, considering that intervals rich in organic matter are the basic requirements for oil and gas accumulation. However, the determination of TOC can be costly, demanding destructive tests in samples from the source rock, expensive laboratory machinery, and specialized personnel. In this context, one notes the necessity of the computational methods to bypass those problems and that machine learning models emerge as an option. One approach to integrating machine learning methods improves performance and, consequently, the prediction quality is stacking models. This paper presents a super learner strategy, based on stacking approaches, as a surrogate model for TOC modeling. The super-learner has three levels in this structure containing different types of learners (machine learning methods), where two stack models from the first two levels. The following machine learning models were used in the building of super learner the K-Neighbors Nearest (KNN), Linear Regression (LR), Multi-layer Perceptron Neural Network (MLP), Random Forest (RF), Ridge Regression (RR), and Support Vector Regression (SVR). The proposed model was compared with standalone machine learning models and other canonical stacking models. The resulting super learner stacking model attained the best average performance for the TOC modeling (R = 0.897, symbolscript = 0.80, RMSE = 1.16, MAE = 0.93, and MAPE = 28.30%). The proposed approach produces an alternative data-driven efficient model for TOC prediction, resulting in reliable automated technology to assist oil and gas well management and decision-making.
引用
收藏
页数:12
相关论文
共 74 条
[1]   Pan evaporation estimation by relevance vector machine tuned with new metaheuristic algorithms using limited climatic data [J].
Adnan, Rana Muhammad ;
Mostafa, Reham R. ;
Dai, Hong-Liang ;
Heddam, Salim ;
Kuriqi, Alban ;
Kisi, Ozgur .
ENGINEERING APPLICATIONS OF COMPUTATIONAL FLUID MECHANICS, 2023, 17 (01)
[2]   A systematic and critical review on effective utilization of artificial intelligence for bio-diesel production techniques [J].
Ahmad, Junaid ;
Awais, Muhammad ;
Rashid, Umer ;
Ngamcharussrivichai, Chawalit ;
Naqvi, Salman Raza ;
Ali, Imtiaz .
FUEL, 2023, 338
[3]   Stacking Artificial Intelligence Models for Predicting Water Quality Parameters in Rivers [J].
Almadani, Mohammad ;
Kheimi, Marwan .
JOURNAL OF ECOLOGICAL ENGINEERING, 2023, 24 (02) :152-164
[5]  
Alqahtani A., 2014, SPE AAPG SEG UNC RES, P17, DOI [10.15530/URTEC-2014-1921783, DOI 10.15530/URTEC-2014-1921783]
[6]  
Alshakhs M., 2017, The Open Petroleum Engineering Journal, V10, P118, DOI DOI 10.2174/1874834101710010118
[7]   Effective machine learning identification of TOC-rich zones in the Eagle Ford Shale [J].
Amosu, Adewale ;
Imsalem, Mohamed ;
Sun, Yuefeng .
JOURNAL OF APPLIED GEOPHYSICS, 2021, 188
[8]   Estimating total organic carbon (TOC) of shale rocks from their mineral composition using stacking generalization approach of machine learning [J].
Asante-Okyere, Solomon ;
Marfo, Solomon Adjei ;
Ziggah, Yao Yevenyo .
UPSTREAM OIL AND GAS TECHNOLOGY, 2023, 11
[9]   Dynamic committee machine with fuzzy-c-means clustering for total organic carbon content prediction from wireline logs [J].
Bai, Yang ;
Tan, Maojin .
COMPUTERS & GEOSCIENCES, 2021, 146
[10]   Model stacking to improve prediction and variable importance robustness for soft sensor [J].
Barton, Maxwell ;
Lennox, Barry .
DIGITAL CHEMICAL ENGINEERING, 2022, 3