A machine learning approach for corrosion small datasets

被引:52
|
作者
Sutojo, Totok [1 ,2 ]
Rustad, Supriadi [1 ,2 ]
Akrom, Muhamad [1 ,3 ]
Syukur, Abdul [2 ]
Shidik, Guruh Fajar [2 ]
Dipojono, Hermawan Kresno [3 ]
机构
[1] Dian Nuswantoro Univ, Fac Comp Sci, Res Ctr Mat Informat, Semarang 50131, Indonesia
[2] Dian Nuswantoro Univ, Fac Comp Sci, Doctoral Program Comp Sci, Semarang 50131, Indonesia
[3] Bandung Inst Technol, Adv Funct Mat Res Grp, Bandung 40132, Indonesia
关键词
INHIBITION EFFICIENCY; BENZIMIDAZOLE DERIVATIVES; GAS-INDUSTRY; MILD-STEEL; PREDICTION; DESIGN; MODEL; OIL;
D O I
10.1038/s41529-023-00336-7
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In this work, we developed a QSAR model using the K-Nearest Neighbor (KNN) algorithm to predict the corrosion inhibition performance of the inhibitor compound. To overcome the small dataset problems, virtual samples are generated and added to the training set using a Virtual Sample Generation (VSG) method. The generalizability of the proposed KNN + VSG model is verified by using six small datasets from references and comparing their prediction performances. The research shows that for the six datasets, the proposed model is able to make predictions with the best accuracy. Adding virtual samples to the training data helps the algorithm recognize feature-target relationship patterns, and therefore increases the number of chemical quantum parameters correlated with corrosion inhibition efficiency. This proposed method strengthens the prospect of ML for developing material designs, especially in the case of small datasets.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] A machine learning approach for corrosion small datasets
    Totok Sutojo
    Supriadi Rustad
    Muhamad Akrom
    Abdul Syukur
    Guruh Fajar Shidik
    Hermawan Kresno Dipojono
    npj Materials Degradation, 7
  • [2] An Approach for Validating Quality of Datasets for Machine Learning
    Ding, Junhua
    Li, XinChuan
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 2795 - 2803
  • [3] A refined approach for evaluating small datasets via binary classification using machine learning
    Steinert, Steffen
    Ruf, Verena
    Dzsotjan, David
    Grossmann, Nicolas
    Schmidt, Albrecht
    Kuhn, Jochen
    Kuechemann, Stefan
    PLOS ONE, 2024, 19 (05):
  • [4] Machine Learning Methods with Noisy, Incomplete or Small Datasets
    Caiafa, Cesar F.
    Sun, Zhe
    Tanaka, Toshihisa
    Marti-Puig, Pere
    Sole-Casals, Jordi
    APPLIED SCIENCES-BASEL, 2021, 11 (09):
  • [5] Advancing flame retardant prediction: A self-enforcing machine learning approach for small datasets
    Yan, Cheng
    Lin, Xiang
    Feng, Xiaming
    Yang, Hongyu
    Mensah, Patrick
    Li, Guoqiang
    APPLIED PHYSICS LETTERS, 2023, 122 (25)
  • [6] A strategy to apply machine learning to small datasets in materials science
    Ying Zhang
    Chen Ling
    npj Computational Materials, 4
  • [7] Decomposition Methods for Machine Learning with Small, Incomplete or Noisy Datasets
    Caiafa, Cesar Federico
    Sole-Casals, Jordi
    Marti-Puig, Pere
    Zhe, Sun
    Tanaka, Toshihisa
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 20
  • [8] Simple Baseline Machine Learning Text Classifiers for Small Datasets
    Riekert M.
    Riekert M.
    Klein A.
    SN Computer Science, 2021, 2 (3)
  • [9] A strategy to apply machine learning to small datasets in materials science
    Zhang, Ying
    Ling, Chen
    NPJ COMPUTATIONAL MATERIALS, 2018, 4
  • [10] Transfer Learning Approach for Indoor Localization with Small Datasets
    Yoon, Jeonghyeon
    Oh, Jisoo
    Kim, Seungku
    REMOTE SENSING, 2023, 15 (08)