Learning from mistakes: Sampling strategies to efficiently train machine learning models for material property prediction

被引:11
作者
Magar, Rishikesh [1 ]
Farimani, Amir Barati [1 ]
机构
[1] Carnegie Mellon Univ, Dept Mech Engn, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
Machine learning; Sampling algorithms; DISCOVERY; NETWORKS;
D O I
10.1016/j.commatsci.2023.112167
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recent advances in machine learning (ML) based methodologies have accelerated the prediction of the physical properties of materials. These ML models, however, rely on large amounts of simulated or experimental data to make a reliable prediction. This dependence on large amounts of data can be a roadblock to building ML models since collecting the data is prohibitively expensive and time-consuming. In this work, we propose two sampling strategies to reliably train machine learning models in the lowest amounts of data. Our algorithms alleviate the need to generate large datasets to train machine learning models. We demonstrate the effectiveness of these sampling strategies by improving the performance of Crystal Graph Convolutional Neural Network (CGCNN) on four different datasets. Using the proposed strategies, we can reach the benchmark performance of CGCNN models in fewer data samples.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Machine Learning-Assisted Property Prediction of Solid-State Electrolyte
    Li, Jin
    Zhou, Meisa
    Wu, Hong-Hui
    Wang, Lifei
    Zhang, Jian
    Wu, Naiteng
    Pan, Kunming
    Liu, Guilong
    Zhang, Yinggan
    Han, Jiajia
    Liu, Xianming
    Chen, Xiang
    Wan, Jiayu
    Zhang, Qiaobao
    ADVANCED ENERGY MATERIALS, 2024, 14 (20)
  • [22] Relationship of structure and mechanical property of silica with enhanced sampling and machine learning
    Deng, Yuanpeng
    Du, Tao
    Li, Hui
    JOURNAL OF THE AMERICAN CERAMIC SOCIETY, 2021, 104 (08) : 3910 - 3920
  • [23] Negative sampling strategies impact the prediction of scale-free biomolecular network interactions with machine learning
    Pengpai Li
    Bowen Shao
    Guoqing Zhao
    Zhi-Ping Liu
    BMC Biology, 23 (1)
  • [24] Machine Learning of Surface Layer Property Prediction for Milling Operations
    Uhlmann, Eckart
    Holznagel, Tobias
    Schehl, Philipp
    Bode, Yannick
    JOURNAL OF MANUFACTURING AND MATERIALS PROCESSING, 2021, 5 (04):
  • [25] Reservoir Property Prediction in the North Sea Using Machine Learning
    Al-Fakih, Abdulrahman
    Kaka, Sanlinn I.
    Koeshidayatullah, Ardiansyah I.
    IEEE ACCESS, 2023, 11 : 140148 - 140160
  • [26] Forecasting tuberculosis incidence: a review of time series and machine learning models for prediction and eradication strategies
    Maipan-Uku, Jamilu Yahaya
    Cavus, Nadire
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL HEALTH RESEARCH, 2025, 35 (03) : 645 - 660
  • [27] Machine Learning in Membrane Design: From Property Prediction to AI-Guided Optimization
    Cao, Zhonglin
    Farimani, Omid Barati
    Ock, Janghoon
    Farimani, Amir Barati
    NANO LETTERS, 2024, 24 (10) : 2953 - 2960
  • [28] Machine Learning and Deep Learning Models for Dengue Diagnosis Prediction: A Systematic Review
    Giron, Daniel Cristobal Andrade
    Rodriguez, William Joel Marin
    Lioo-Jordan, Flor de Maria
    Sanchez, Jose Luis Ausejo
    INFORMATICS-BASEL, 2025, 12 (01):
  • [29] Application of machine learning ensemble models for rainfall prediction
    Ahmadi, Hasan
    Aminnejad, Babak
    Sabatsany, Hojat
    ACTA GEOPHYSICA, 2023, 71 (04) : 1775 - 1786
  • [30] Machine Learning Models for Early Dengue Severity Prediction
    Caicedo-Torres, William
    Paternina, Angel
    Pinzon, Hernando
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 : 247 - 258