Prediction of TOC Content in Organic-Rich Shale Using Machine Learning Algorithms: Comparative Study of Random Forest, Support Vector Machine, and XGBoost

被引:17
作者
Sun, Jiangtao [1 ]
Dang, Wei [1 ,2 ]
Wang, Fengqin [1 ,2 ]
Nie, Haikuan [3 ]
Wei, Xiaoliang [4 ,5 ]
Li, Pei [3 ]
Zhang, Shaohua [1 ,2 ]
Feng, Yubo [1 ]
Li, Fei [1 ]
机构
[1] Xian Shiyou Univ, Sch Earth Sci & Engn, Xian 710065, Peoples R China
[2] Xian Shiyou Univ, Shaanxi Key Lab Petr Accumulat Geol, Xian 710065, Peoples R China
[3] SINOPEC, Petr Explorat & Prod Res Inst, Beijing 100083, Peoples R China
[4] SINOPEC, Explorat & Dev Inst Shengli Oilfield Co, Dongying 257000, Peoples R China
[5] China Univ Geosci, Key Lab Strategy Evaluat Shale Gas, Minist Land & Resources, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
TOC content; random forest; support vector machine; XGBoost; organic-rich shale; APPALACHIAN DEVONIAN SHALES; TRANSITIONAL BLACK SHALES; NORTH CHINA BASIN; ORDOS BASIN; WELL LOGS; LACUSTRINE SHALES; NEURAL-NETWORKS; SOURCE ROCKS; MATTER; CARBON;
D O I
10.3390/en16104159
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The total organic carbon (TOC) content of organic-rich shale is a key parameter in screening for potential source rocks and sweet spots of shale oil/gas. Traditional methods of determining the TOC content, such as the geochemical experiments and the empirical mathematical regression method, are either high cost and low-efficiency, or universally non-applicable and low-accuracy. In this study, we propose three machine learning models of random forest (RF), support vector regression (SVR), and XGBoost to predict the TOC content using well logs, and the performance of each model are compared with the traditional empirical methods. First, the decision tree algorithm is used to identify the optimal set of well logs from a total of 15. Then, 816 data points of well logs and the TOC content data collected from five different shale formations are used to train and test these three models. Finally, the accuracy of three models is validated by predicting the unknown TOC content data from a shale oil well. The results show that the RF model provides the best prediction for the TOC content, with R-2 = 0.915, MSE = 0.108, and MAE = 0.252, followed by the XGBoost, while the SVR gives the lowest predictive accuracy. Nevertheless, all three machine learning models outperform the traditional empirical methods such as Schmoker gamma-ray log method, multiple linear regression method and ?lgR method. Overall, the proposed machine learning models are powerful tools for predicting the TOC content of shale and improving the oil/gas exploration efficiency in a different formation or a different basin.
引用
收藏
页数:26
相关论文
共 87 条
  • [11] [邓宇 Deng Yu], 2019, [天然气地球科学, Natural Gas Geoscience], V30, P414
  • [12] [邓运华 DENG Yunhua], 2007, [石油勘探与开发, Petroleum Exploration and Development], V34, P646
  • [13] Dong Dazhong, 2016, Natural Gas Industry B, V3, P12, DOI 10.1016/j.ngib.2016.02.002
  • [14] GAMMA-RAY SPECTRAL EVALUATION TECHNIQUES IDENTIFY FRACTURED SHALE RESERVOIRS AND SOURCE-ROCK CHARACTERISTICS
    FERTL, WH
    RIEKE, HH
    [J]. JOURNAL OF PETROLEUM TECHNOLOGY, 1980, 32 (11): : 2053 - 2062
  • [15] Distribution of naturally occurring radionuclides (U, Th) in Timahdit black shale (Morocco)
    Galindo, C.
    Mougin, L.
    Fakhi, S.
    Nourreddine, A.
    Larnghari, A.
    Hannache, H.
    [J]. JOURNAL OF ENVIRONMENTAL RADIOACTIVITY, 2007, 92 (01) : 41 - 54
  • [16] [郭清海 Guo Qinghai], 2014, [地学前缘, Earth Science Frontiers], V21, P83
  • [17] [郭雯 Guo Wen], 2021, [天然气工业, Natural Gas Industry], V41, P65
  • [18] [郭泽清 Guo Zeqing], 2012, [地球物理学进展, Progress in Geophysiscs], V27, P626
  • [19] Enrichment characteristics and exploration directions of deep shale gas of Ordovician-Silurian in the Sichuan Basin and its surrounding areas, China
    Haikuan, Nie
    Pei, L., I
    Wei, Dang
    Jianghui, Ding
    Chuanxiang, Sun
    Mi, Liu
    Jin, Wang
    Wei, Du
    Peixian, Zhang
    Donghui, Li
    Haikun, Su
    [J]. PETROLEUM EXPLORATION AND DEVELOPMENT, 2022, 49 (04) : 744 - 757
  • [20] Prediction of total organic carbon at Rumaila oil field, Southern Iraq using conventional well logs and machine learning algorithms
    Handhal, Amna M.
    Al-Abadi, Alaa M.
    Chafeet, Hussein E.
    Ismail, Maher J.
    [J]. MARINE AND PETROLEUM GEOLOGY, 2020, 116