Learning from mistakes: Sampling strategies to efficiently train machine learning models for material property prediction

被引:13
作者
Magar, Rishikesh [1 ]
Farimani, Amir Barati [1 ]
机构
[1] Carnegie Mellon Univ, Dept Mech Engn, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
Machine learning; Sampling algorithms; DISCOVERY; NETWORKS;
D O I
10.1016/j.commatsci.2023.112167
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Recent advances in machine learning (ML) based methodologies have accelerated the prediction of the physical properties of materials. These ML models, however, rely on large amounts of simulated or experimental data to make a reliable prediction. This dependence on large amounts of data can be a roadblock to building ML models since collecting the data is prohibitively expensive and time-consuming. In this work, we propose two sampling strategies to reliably train machine learning models in the lowest amounts of data. Our algorithms alleviate the need to generate large datasets to train machine learning models. We demonstrate the effectiveness of these sampling strategies by improving the performance of Crystal Graph Convolutional Neural Network (CGCNN) on four different datasets. Using the proposed strategies, we can reach the benchmark performance of CGCNN models in fewer data samples.
引用
收藏
页数:8
相关论文
共 50 条
[31]   Machine Learning of Surface Layer Property Prediction for Milling Operations [J].
Uhlmann, Eckart ;
Holznagel, Tobias ;
Schehl, Philipp ;
Bode, Yannick .
JOURNAL OF MANUFACTURING AND MATERIALS PROCESSING, 2021, 5 (04)
[32]   Reservoir Property Prediction in the North Sea Using Machine Learning [J].
Al-Fakih, Abdulrahman ;
Kaka, Sanlinn I. ;
Koeshidayatullah, Ardiansyah I. .
IEEE ACCESS, 2023, 11 :140148-140160
[33]   Forecasting tuberculosis incidence: a review of time series and machine learning models for prediction and eradication strategies [J].
Maipan-Uku, Jamilu Yahaya ;
Cavus, Nadire .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL HEALTH RESEARCH, 2025, 35 (03) :645-660
[34]   Machine Learning in Membrane Design: From Property Prediction to AI-Guided Optimization [J].
Cao, Zhonglin ;
Farimani, Omid Barati ;
Ock, Janghoon ;
Farimani, Amir Barati .
NANO LETTERS, 2024, 24 (10) :2953-2960
[35]   Machine Learning and Deep Learning Models for Early Sepsis Prediction: A Scoping Review [J].
Shanmugam, Hemalatha ;
Airen, Lavanya ;
Rawat, Saumya .
INDIAN JOURNAL OF CRITICAL CARE MEDICINE, 2025, 29 (06) :516-524
[36]   Machine Learning and Deep Learning Models for Dengue Diagnosis Prediction: A Systematic Review [J].
Giron, Daniel Cristobal Andrade ;
Rodriguez, William Joel Marin ;
Lioo-Jordan, Flor de Maria ;
Sanchez, Jose Luis Ausejo .
INFORMATICS-BASEL, 2025, 12 (01)
[37]   Machine Learning Models for Early Dengue Severity Prediction [J].
Caicedo-Torres, William ;
Paternina, Angel ;
Pinzon, Hernando .
ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2016, 2016, 10022 :247-258
[38]   Machine learning models for prognosis prediction in endodontic microsurgery [J].
Qu, Yang ;
Lin, Zhenzhe ;
Yang, Zhaojing ;
Lin, Haotian ;
Huang, Xiangya ;
Gu, Lisha .
JOURNAL OF DENTISTRY, 2022, 118
[39]   Application of machine learning ensemble models for rainfall prediction [J].
Ahmadi, Hasan ;
Aminnejad, Babak ;
Sabatsany, Hojat .
ACTA GEOPHYSICA, 2023, 71 (04) :1775-1786
[40]   End-to-end material thermal conductivity prediction through machine learning [J].
Srivastava, Yagyank ;
Jain, Ankit .
JOURNAL OF APPLIED PHYSICS, 2023, 134 (22)