A framework of active learning and semi-supervised learning for lithology identification based on improved naive Bayes

被引:39
作者
Ren, Quan [1 ]
Zhang, Hongbing [1 ]
Zhang, Dailu [1 ]
Zhao, Xiang [1 ]
Yan, Lizhi [1 ]
Rui, Jianwen [2 ]
Zeng, Fanxin [1 ]
Zhu, Xinyi [1 ]
机构
[1] Hohai Univ, Sch Earth Sci & Engn, Nanjing 211100, Peoples R China
[2] Nanjing Vocat, Sch Artificial Intelligence, Coll Informat Technol, Nanjing, Peoples R China
基金
中国国家自然科学基金;
关键词
Lithology identification; Active learning; Semi-supervised learning; Logging data; Naive Bayes; PERMEABILITY; PREDICTION; CLASSIFIER; RESERVOIR;
D O I
10.1016/j.eswa.2022.117278
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lithology identification is the basis of energy exploration and reservoir evaluation, intelligent and accurate identification of underground lithology is a key issue. The establishment of a machine learning lithology identification model using logging data is a hot research direction in recent years. However, the logging data has a high degree of non-linearity and multi-response characteristics, and there are insufficient numbers of labeled samples in the training data set. These will eventually affect the modeling accuracy and may cause over-fitting. Therefore, a framework of active learning and semi-supervised learning for lithology identification based on improved naive Bayes (ALSLINB) is proposed. The contributions are fourfold: (i) The Gaussian mixture model (GMM) based on the EM algorithm is used to estimate the probability density of the log data, which fits the probability distribution of the nonlinear multi-response log data. (ii) A framework combining active learning (AL) and semi-supervised learning is proposed for the expansion of labeled samples in the training data set. (iii) The application of pseudo-labeling detection technology can effectively improve the authenticity of pseudo-label samples. (iv) Different from the general deterministic lithology identification method, the result of the ALSLINB algorithm corresponds to the probability score, which provides an auxiliary basis for the prediction result. Finally, the ALSLINB algorithm is applied to two different data sets for a large number of experiments and compared with the related baseline methods to verify its effectiveness and generalization ability. The result proves that the ALSLINB algorithm can complete the lithology recognition task well and has high accuracy and robustness, which provides a new direction for intelligent lithology identification.
引用
收藏
页数:14
相关论文
共 44 条
  • [11] Formation lithology classification using scalable gradient boosted decision trees
    Dev, Vikrant A.
    Eden, Mario R.
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2019, 128 : 392 - 404
  • [12] On the optimality of the simple Bayesian classifier under zero-one loss
    Domingos, P
    Pazzani, M
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 103 - 130
  • [13] Deep Recurrent Neural Networks Approach to Sedimentary Facies Classification Using Well Logs
    dos Santos, Daniel Theisges
    Roisenberg, Mauro
    Nascimento, Marivaldo dos Santos
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [14] Lithology identification and reservoir characteristics of the mixed siliciclastic-carbonate rocks of the lower third member of the Shahejie formation in the south of the Laizhouwan Sag, Bohai Bay Basin, China
    Duan, Yajun
    Xie, Jun
    Li, Baichuan
    Wang, Mengqi
    Zhang, Tianqi
    Zhou, Yawei
    [J]. CARBONATES AND EVAPORITES, 2020, 35 (02)
  • [15] Improved well log classification using semisupervised Gaussian mixture models and a new hyper-parameter selection strategy
    Dunham, Michael W.
    Malcolm, Alison
    Welford, J. Kim
    [J]. COMPUTERS & GEOSCIENCES, 2020, 140 (140)
  • [16] Uncertainty analysis in well log classification by Bayesian long short-term memory networks
    Feng, Runhai
    [J]. JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2021, 205
  • [17] Unsupervised learning of finite mixture models
    Figueiredo, MAT
    Jain, AK
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (03) : 381 - 396
  • [18] Bayesian network classifiers
    Friedman, N
    Geiger, D
    Goldszmidt, M
    [J]. MACHINE LEARNING, 1997, 29 (2-3) : 131 - 163
  • [19] A hybrid safe semi-supervised learning method
    Gan, Haitao
    Guo, Li
    Xia, Siyu
    Wang, Tao
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 149
  • [20] Guo YH, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P823