Synthetic pre-training for neural-network interatomic potentials

被引:10
|
作者
Gardner, John L. A. [1 ]
Baker, Kathryn T. [1 ]
Deringer, Volker L. [1 ]
机构
[1] Univ Oxford, Dept Chem, Inorgan Chem Lab, Oxford OX1 3QR, England
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 01期
基金
英国工程与自然科学研究理事会; 英国科研创新办公室;
关键词
machine learning; neural networks; synthetic data; atomistic simulations; molecular dynamics;
D O I
10.1088/2632-2153/ad1626
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning (ML) based interatomic potentials have transformed the field of atomistic materials modelling. However, ML potentials depend critically on the quality and quantity of quantum-mechanical reference data with which they are trained, and therefore developing datasets and training pipelines is becoming an increasingly central challenge. Leveraging the idea of 'synthetic' (artificial) data that is common in other areas of ML research, we here show that synthetic atomistic data, themselves obtained at scale with an existing ML potential, constitute a useful pre-training task for neural-network (NN) interatomic potential models. Once pre-trained with a large synthetic dataset, these models can be fine-tuned on a much smaller, quantum-mechanical one, improving numerical accuracy and stability in computational practice. We demonstrate feasibility for a series of equivariant graph-NN potentials for carbon, and we carry out initial experiments to test the limits of the approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] SIMPLE-NN: An efficient package for training and executing neural-network interatomic potentials
    Lee, Kyuhyun
    Yoo, Dongsun
    Jeong, Wonseok
    Han, Seungwu
    COMPUTER PHYSICS COMMUNICATIONS, 2019, 242 : 95 - 103
  • [2] Specialized Pre-Training of Neural Networks on Synthetic Data for Improving Paraphrase Generation
    Skurzhanskyi, O. H.
    Marchenko, O. O.
    Anisimov, A. V.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2024, 60 (02) : 167 - 174
  • [3] Specialized Pre-Training of Neural Networks on Synthetic Data for Improving Paraphrase Generation
    O. H. Skurzhanskyi
    O. O. Marchenko
    A. V. Anisimov
    Cybernetics and Systems Analysis, 2024, 60 : 167 - 174
  • [4] Elaboration of a neural-network interatomic potential for silica glass and melt
    Trillot, Salome
    Lam, Julien
    Ispas, Simona
    Kandy, Akshay Krishna Ammothum
    Tuckerman, Mark E.
    Tarrat, Nathalie
    Benoit, Magali
    COMPUTATIONAL MATERIALS SCIENCE, 2024, 236
  • [5] An automated approach for developing neural network interatomic potentials with FLAME
    Mirhosseini, Hossein
    Tahmasbi, Hossein
    Kuchana, Sai Ram
    Ghasemi, S. Alireza
    Kuehne, Thomas D.
    COMPUTATIONAL MATERIALS SCIENCE, 2021, 197
  • [6] LAMMPS implementation of rapid artificial neural network derived interatomic potentials
    Dickel, D.
    Nitol, M.
    Barrett, C. D.
    COMPUTATIONAL MATERIALS SCIENCE, 2021, 196
  • [7] Evaluating synthetic pre-Training for handwriting processing tasks
    Pippi, Vittorio
    Cascianelli, Silvia
    Baraldi, Lorenzo
    Cucchiara, Rita
    PATTERN RECOGNITION LETTERS, 2023, 172 : 44 - 50
  • [8] An implementation of artificial neural-network potentials for atomistic materials simulations: Performance for TiO2
    Artrith, Nongnuch
    Urban, Alexander
    COMPUTATIONAL MATERIALS SCIENCE, 2016, 114 : 135 - 150
  • [9] Data efficiency and extrapolation trends in neural network interatomic potentials
    Vita J.A.
    Schwalbe-Koda D.
    Machine Learning: Science and Technology, 2023, 4 (03):
  • [10] BrainNPT: Pre-Training Transformer Networks for Brain Network Classification
    Hu, Jinlong
    Huang, Yangmin
    Wang, Nan
    Dong, Shoubin
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2024, 32 : 2727 - 2736