Machine Learning Method for TOC Prediction: Taking Wufeng and Longmaxi Shales in the Sichuan Basin, Southwest China as an Example

被引:18
作者
Rong, Jia [1 ,2 ,3 ]
Zheng, Zongyuan [1 ,2 ,3 ]
Luo, Xiaorong [1 ,2 ,3 ]
Li, Chao [1 ,2 ]
Li, Yuping [4 ]
Wei, Xiangfeng [4 ]
Wei, Quanchao [4 ]
Yu, Guangchun [4 ]
Zhang, Likuan [1 ,2 ]
Lei, Yuhong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Geol & Geophys, Key Lab Petr Resources Res, Beijing 100029, Peoples R China
[2] Chinese Acad Sci, Innovat Acad Earth Sci, Beijing 100029, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[4] SINOPEC, Explorat Branch Co, Chengdu 610041, Peoples R China
关键词
TOTAL ORGANIC-CARBON; SOURCE ROCKS; WELL LOGS; IDENTIFICATION; MODEL; RESISTIVITY; RICHNESS; POROSITY; TRENDS; MATTER;
D O I
10.1155/2021/6794213
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The total organic carbon content (TOC) is a core indicator for shale gas reservoir evaluations. Machine learning-based models can quickly and accurately predict TOC, which is of great significance for the production of shale gas. Based on conventional logs, the measured TOC values, and other data of 9 typical wells in the Jiaoshiba area of the Sichuan Basin, this paper performed a Bayesian linear regression and applied a random forest machine learning model to predict TOC values of the shale from the Wufeng Formation and the lower part of the Longmaxi Formation. The results showed that the TOC value prediction accuracy was improved by more than 50% by using the well-trained machine learning models compared with the traditional Delta LogR method in an overmature and tight shale. Using the halving random search cross-validation method to optimize hyperparameters can greatly improve the speed of building the model. Furthermore, excluding the factors that affect the log value other than the TOC and taking the corrected data as input data for training could improve the prediction accuracy of the random forest model by approximately 5%. Data can be easily updated with machine learning models, which is of primary importance for improving the efficiency of shale gas exploration and development.
引用
收藏
页数:13
相关论文
共 62 条
[1]  
Ahangari D., 2021, PREDICTION GEOCHEMIC, DOI DOI 10.1016/J.PETLM.2021.04.007
[2]  
[安鹏 An Peng], 2018, [地球物理学进展, Progress in Geophysiscs], V33, P1029
[3]  
BEERS RF, 1945, AAPG BULL, V29, P1
[4]  
Bishop Christopher M, 2006, PATTERN RECOGN, V128, P1, DOI [10.1117/1.2819119, DOI 10.1117/1]
[5]   Analyzing organic richness of source rocks from well log data by using SVM and ANN classifiers: A case study from the Kazhdumi formation, the Persian Gulf basin, offshore Iran [J].
Bolandi, Vahid ;
Kadkhodaie, Ali ;
Farzi, Reza .
JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2017, 151 :224-234
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   Cross-validation methods [J].
Browne, MW .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :108-132
[8]   The importance of the loss function in option valuation [J].
Christoffersen, P ;
Jacobs, K .
JOURNAL OF FINANCIAL ECONOMICS, 2004, 72 (02) :291-318
[9]   The upside of uncertainty: Identification of lithology contact zones from airborne geophysics and satellite data using random forests and support vector machines [J].
Cracknell, Matthew J. ;
Reading, Anya M. .
GEOPHYSICS, 2013, 78 (03) :WB113-WB126
[10]  
Draper N. R., 1998, APPL REGRESSION ANAL