Optimization of machine learning approaches for shale gas production forecast

被引:21
作者
Wang, Muming [1 ]
Hui, Gang [1 ]
Pang, Yu [1 ]
Wang, Shuhua [1 ]
Chen, Shengnan [1 ]
机构
[1] Univ Calgary, Dept Chem & Petr Engn, Calgary, AB, Canada
来源
GEOENERGY SCIENCE AND ENGINEERING | 2023年 / 226卷
基金
加拿大自然科学与工程研究理事会;
关键词
Shale gas production; Machine learning; Model optimization; Overfitting; Sparrow search algorithm;
D O I
10.1016/j.geoen.2023.211719
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Successful production from shale gas formations has changed the energy outlook of North America. During such process, a significant amount of data related to production and operations has been accumulated, which can be analyzed with machine learning (ML) techniques for production forecasts and performance optimization. In this study, an optimization workflow is proposed for production forecast in shale gas reservoirs, focusing on mitigating the overfitting problem while maximizing its accuracy. More specifically, a new objective function is developed where a penalty term is added to mitigate the model's overfitting tendency. The kernel density estimate (KDE) method guarantees that a similar distribution is shared between the split training and testing subsets. Four machine learning algorithms, including XGBoost, LightGBM, CatBoost, and Random Forest, are utilized and further optimized by the sparrow search algorithm with the newly developed objective function. A stacking approach is finally established based on the optimized algorithm and selected meta-models. Results show that the R2 of all four models has increased to around 0.9 after the optimization process, where XGBoost and LightGBM perform slightly better than the rest. In addition, R2 of the training dataset for XGBoost was decreased from 0.999 to 0.937, while the R2 for the testing dataset was increased from 0.762 to 0.905. This indicates that the overfitting has been mitigated after the optimization with the gap between the training and testing datasets reduced from 0.237 to 0.032. Moreover, stacking the optimized ML algorithms can not further enhance its accuracy. The best performer with SVM as the meta-model can only increase the R2 to 0.910, yet the model complexity increases significantly.
引用
收藏
页数:9
相关论文
共 49 条
[1]  
Ahmadi MA, 2015, PETROLEUM-PRC, V1, P118, DOI [10.1016/j.petlm.2015.06.004, DOI 10.1016/J.PETLM.2015.06.004, 10.1016/j.petlm.2015.06.004]
[2]   ANALYSIS OF DECLINE CURVES [J].
ARPS, JJ .
TRANSACTIONS OF THE AMERICAN INSTITUTE OF MINING AND METALLURGICAL ENGINEERS, 1945, 160 :228-247
[3]   Prediction of the sulfur solubility in pure H2S and sour gas by intelligent models [J].
Bian, Xiao-Qiang ;
Song, Yi-Lun ;
Mwamukonda, Martin Kelvin ;
Fu, Yu .
JOURNAL OF MOLECULAR LIQUIDS, 2020, 299
[4]   Prediction of Wax Disappearance Temperature by Intelligent Models [J].
Bian, Xiao-Qiang ;
Huang, Jin-Hui ;
Wang, Ying ;
Liu, Yong-Bing ;
Kasthuriarachchi, Don Thisal Kaushika ;
Huang, Lin-Jun .
ENERGY & FUELS, 2019, 33 (04) :2934-2949
[5]   A deep-learning approach for reservoir evaluation for shale gas wells with complex fracture networks [J].
Chu, Hongyang ;
Dong, Peng ;
Lee, W. John .
ADVANCES IN GEO-ENERGY RESEARCH, 2023, 7 (01) :49-65
[6]  
Clark A.J., 2011, P SPE ANN TECHN C EX
[7]  
Dzeroski S, 2004, MACH LEARN, V54, P255, DOI 10.1023/B.MAC.0000015881.36452.6e
[8]   The role of aseismic slip in hydraulic fracturing-induced seismicity [J].
Eyre, Thomas S. ;
Eaton, David W. ;
Garagash, Dmitry I. ;
Zecevic, Megan ;
Venieri, Marco ;
Weir, Ronald ;
Lawton, Donald C. .
SCIENCE ADVANCES, 2019, 5 (08)
[9]  
Fox A.D., 2015, GeoConvention: New Horizon, P4
[10]   Fault and conduit controlled burial dolomitization of the Devonian west-central Alberta Deep Basin [J].
Green, DG ;
Mountjoy, EW .
BULLETIN OF CANADIAN PETROLEUM GEOLOGY, 2005, 53 (02) :101-129