An automatic methodology for lithology identification in a tight sandstone reservoir using a bidirectional long short-term memory network combined with Borderline-SMOTE

被引:1
作者
Hu, Chong [1 ]
Deng, Rui [1 ,2 ]
Hu, Xueyi [1 ]
He, Mengcheng [1 ]
Zhao, Hui [3 ]
Jiang, Xuemeng [3 ]
机构
[1] Yangtze Univ, Key Lab Oil & Gas Resources & Explorat Technol, Minist Educ, Wuhan 430100, Hubei, Peoples R China
[2] China Natl Logging Corp, Xian, Shaanxi, Peoples R China
[3] PetroChina Qinghai Oilfield Co, Dunhuang 736202, Gansu, Peoples R China
关键词
Tight sandstone reservoirs; Lithology identification; Bidirectional long short-term memory; Imbalanced class; Deep neural network; NEURAL-NETWORK; CLASSIFICATION;
D O I
10.1007/s11600-024-01492-3
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The increasing difficulty in conventional oil and gas exploration and development has brought tight sandstone reservoirs into focus. These reservoirs have a variety of lithofacies types and strong heterogeneity. Accurately identifying the complex lithology of tight sandstone reservoirs is crucial for locating favorable reservoirs and guiding oil and gas exploration and development. In this study, we propose a hybrid method for imbalanced lithology identification. This method combines a bidirectional long short-term memory (BiLSTM) deep neural network model with Borderline-SMOTE. The Borderline-SMOTE oversampling method handles class-imbalanced data by region, generating new minority class samples in a targeted manner, balancing the data and reducing noise interference. The BiLSTM model integrates information from the upper and lower surrounding rock strata, capturing complex features of continuous sequence data and accurately identifying small interbedded lithologies within long sections of mudstone or main sandstone. We compared our model with support vector machines and neural network models using actual logging data and the public dataset. We used accuracy, precision, recall, and F beta score as evaluation metrics. The results show that our method performs well on imbalanced data, with higher recall and F beta score than other models. It also demonstrates superior robustness on public datasets, showing potential for practical application.
引用
收藏
页码:2319 / 2335
页数:17
相关论文
共 52 条
  • [1] Akkurt R, 2019, P SPE ANN TECHN C EX, DOI [10.2118/196178-MS, DOI 10.2118/196178-MS]
  • [2] Al-Selwi S. M., 2023, J. Adv. Res. Appl. Sci. Eng. Technol, V30, P16, DOI [10.37934/araset.30.3.1631, DOI 10.37934/ARASET.30.3.1631]
  • [3] Interpretable machine learning with tree-based shapley additive explanations: Application to metabolomics datasets for binary classification
    Bifarin, Olatomiwa O.
    [J]. PLOS ONE, 2023, 18 (05):
  • [4] Data-driven predictions of a multiscale Lorenz 96 chaotic system using machine-learning methods: reservoir computing, artificial neural network, and long short-term memory network
    Chattopadhyay, Ashesh
    Hassanzadeh, Pedram
    Subramanian, Devika
    [J]. NONLINEAR PROCESSES IN GEOPHYSICS, 2020, 27 (03) : 373 - 389
  • [5] Chawla NV, 2010, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, SECOND EDITION, P875, DOI 10.1007/978-0-387-09823-4_45
  • [6] BILSTM-Based Deep Neural Network for Rock-Mass Classification Prediction Using Depth-Sequence MWD Data: A Case Study of a Tunnel in Yunnan, China
    Cheng, Xu
    Tang, Hua
    Wu, Zhenjun
    Liang, Dongcai
    Xie, Yachen
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [7] Dey II., 2023, Proc Art, DOI [10.1109/icsmdi57622.2023.00060, DOI 10.1109/ICSMDI57622.2023.00060]
  • [8] Dong QL, 2022, CHIN CONTR CONF, P3196, DOI 10.23919/CCC55666.2022.9902406
  • [9] A modular approach for multilingual timex detection and normalization using deep learning and grammar-based methods
    Escribano, Nayla
    Rigau, German
    Agerri, Rodrigo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 273
  • [10] Fandi F., 2023, JIP (Jurnal Informatika Polinema), VX, P331, DOI [10.33795/jip.v9i3.1330, DOI 10.33795/JIP.V9I3.1330]