Well log data generation and imputation using sequence based generative adversarial networks

被引:5
作者
Al-Fakih, Abdulrahman [1 ]
Koeshidayatullah, A. [1 ]
Mukerji, Tapan [2 ,3 ,4 ]
Al-Azani, Sadam [5 ]
Kaka, SanLinn I. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Coll Petr Engn & Geosci, Dhahran 31261, Saudi Arabia
[2] Stanford Univ, Dept Energy Sci & Engn, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Earth & Planetary Sci, Stanford, CA 94305 USA
[4] Stanford Univ, Dept Geophys, Stanford, CA 94305 USA
[5] King Fahd Univ Petr & Minerals, SDAIA KFUPM Joint Res Ctr Artificial Intelligence, Dhahran 31261, Saudi Arabia
关键词
Generative adversarial networks models; Time series models; Sequence GAN models; Well log data imputation; Synthetic well log data generation; PREDICTION; RESERVOIR;
D O I
10.1038/s41598-025-95709-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Well log analysis is significant for hydrocarbon exploration, providing detailed insights into subsurface geological formations. However, gaps and inaccuracies in well log data, often due to equipment limitations, operational challenges, and harsh subsurface conditions, can introduce significant uncertainties in reservoir evaluation. Addressing these challenges requires effective methods for both synthetic data generation and precise imputation of missing data, ensuring data completeness and reliability. This study introduces a novel framework utilizing sequence-based generative adversarial networks (GANs) specifically designed for well log data generation and imputation. The framework integrates two distinct sequence-based GAN models: time series GAN (TSGAN) for generating synthetic well log data and sequence GAN (SeqGAN) for imputing missing data. Both models were tested on a dataset from the North Sea, Netherlands region. For the imputation task, the input comprises logs with missing values and the output is the corresponding imputed logs; for the synthetic data generation task, the input is complete real logs and the output is synthetic logs that mimic the statistical properties of the original data. All log measurements are normalized to a 0-1 range using min-max scaling, and error metrics are reported in these normalized units. Different sections of 5, 10, and 50 data points were used. Experimental results demonstrate that this approach achieves superior accuracy in filling data gaps compared to other deep learning models for spatial series analysis. The imputation method yielded R-2 values of 0.92, 0.86, and 0.57, with corresponding mean absolute percentage error (MAPE) values of 8.320, 0.005, and 166.6, and mean absolute error (MAE) values of 0.012, 0.002, and 0.03, respectively. The synthetic generation yielded R-2 of 0.92, MAE, of 0.35, and MRLE of 0.01. These results set a new benchmark for data integrity and utility in geosciences, particularly in well log data analysis.
引用
收藏
页数:21
相关论文
共 52 条
[1]   Advanced machine learning for missing petrophysical property imputation applied to improve the characterization of carbonate reservoirs [J].
Abdulkhaleq, Hussein B. ;
Khalil, Khalil A. ;
Al-Mudhafar, Watheq J. ;
Wood, David A. .
GEOENERGY SCIENCE AND ENGINEERING, 2024, 238
[2]  
Akinnikawe O., 2018, SPE AAPG SEG UNC RES, DOI DOI 10.15530/URTEC-2018-2877021
[3]   Well log analysis and hydrocarbon potential of the Sa'ar-Nayfa reservoir, Hiswah Oilfield, eastern Yemen [J].
Al-Areeq, Nabil M. ;
Alaug, Abdulwahab S. .
ARABIAN JOURNAL OF GEOSCIENCES, 2014, 7 (07) :2941-2956
[4]   Reservoir Property Prediction in the North Sea Using Machine Learning [J].
Al-Fakih, Abdulrahman ;
Kaka, Sanlinn I. ;
Koeshidayatullah, Ardiansyah I. .
IEEE ACCESS, 2023, 11 :140148-140160
[5]   Estimating electrical resistivity from logging data for oil wells using machine learning [J].
Al-Fakih, Abdulrahman ;
Ibrahim, Ahmed Farid ;
Elkatatny, Salaheldin ;
Abdulraheem, Abdulazeez .
JOURNAL OF PETROLEUM EXPLORATION AND PRODUCTION TECHNOLOGY, 2023, 13 (06) :1453-1461
[6]  
Ali N., 2022, GEOSYST GEOENVIRON, V1, DOI DOI 10.1016/J.GEOGEO.2022.100058
[7]   Protect and Extend - Using GANs for Synthetic Data Generation of Time-Series Medical Records [J].
Ashrafi, Navid ;
Schmitt, Vera ;
Spang, Robert P. ;
Moeller, Sebastian ;
Voigt-Antons, Jan-Niklas .
2023 15TH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE, QOMEX, 2023, :171-176
[8]   Missing data imputation of MAGDAS-9′s ground electromagnetism with supervised machine learning and conventional statistical analysis models [J].
Asraf, Muhammad H. ;
Dalila, Nur K. A. ;
Tahir, Nooritawati Md ;
Abd Latiff, Zatul Iffah ;
Jusoh, Mohamad Huzaimy ;
Akimasa, Yoshikawa .
ALEXANDRIA ENGINEERING JOURNAL, 2022, 61 (01) :937-947
[9]  
Benesty J., 2009, Pearson Correlation Coefficient, Noise Reduction in Speech Processing, V2 of, P37
[10]   Synthetic geochemical well logs generation using ensemble machine learning techniques for the Brazilian pre-salt reservoirs [J].
Blanes de Oliveira, Lucas Abreu ;
Carneiro, Cleyton de Carvalho .
JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 196