Synthetic pre-training for neural-network interatomic potentials

被引:10
作者
Gardner, John L. A. [1 ]
Baker, Kathryn T. [1 ]
Deringer, Volker L. [1 ]
机构
[1] Univ Oxford, Dept Chem, Inorgan Chem Lab, Oxford OX1 3QR, England
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 01期
基金
英国科研创新办公室; 英国工程与自然科学研究理事会;
关键词
machine learning; neural networks; synthetic data; atomistic simulations; molecular dynamics;
D O I
10.1088/2632-2153/ad1626
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning (ML) based interatomic potentials have transformed the field of atomistic materials modelling. However, ML potentials depend critically on the quality and quantity of quantum-mechanical reference data with which they are trained, and therefore developing datasets and training pipelines is becoming an increasingly central challenge. Leveraging the idea of 'synthetic' (artificial) data that is common in other areas of ML research, we here show that synthetic atomistic data, themselves obtained at scale with an existing ML potential, constitute a useful pre-training task for neural-network (NN) interatomic potential models. Once pre-trained with a large synthetic dataset, these models can be fine-tuned on a much smaller, quantum-mechanical one, improving numerical accuracy and stability in computational practice. We demonstrate feasibility for a series of equivariant graph-NN potentials for carbon, and we carry out initial experiments to test the limits of the approach.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] NEURAL-NETWORK ARCHITECTURES FOR INDUSTRIAL APPLICATIONS
    DEVENA, L
    MASTRETTA, M
    RICCIARDIELLO, L
    BIOSENSORS & BIOELECTRONICS, 1995, 10 (1-2) : 231 - 236
  • [42] A NEURAL-NETWORK APPROACH TO FREEWAY NETWORK TRAFFIC CONTROL
    PAPAGEORGIOU, M
    MESSMER, A
    AZEMA, J
    DREWANZ, D
    CONTROL ENGINEERING PRACTICE, 1995, 3 (12) : 1719 - 1726
  • [43] 2D-diffractogram analysis: Kinematic-diffraction simulator for neural-network training-data generation
    Mehdi, Redad
    Chawla, Rounak
    Barcelos, Erika I.
    Willard, Matthew A.
    French, Roger H.
    Ernst, Frank
    COMPUTATIONAL MATERIALS SCIENCE, 2025, 252
  • [44] BEATS(sic): Audio Pre-Training with Acoustic Tokenizers
    Chen, Sanyuan
    Wu, Yu
    Wang, Chengyi
    Liu, Shujie
    Tompkins, Daniel
    Chen, Zhuo
    Che, Wanxiang
    Yu, Xiangzhan
    Wei, Furu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
  • [45] Malbert: A novel pre-training method for malware detection
    Xu, Zhifeng
    Fang, Xianjin
    Yang, Gaoming
    COMPUTERS & SECURITY, 2021, 111
  • [46] Fractals as Pre-Training Datasets for Anomaly Detection and Localization
    Ugwu, Cynthia I.
    Caruso, Emanuele
    Lanz, Oswald
    FRACTAL AND FRACTIONAL, 2024, 8 (11)
  • [47] High-Accuracy Neural Network Interatomic Potential for Silicon Nitride
    Xu, Hui
    Li, Zeyuan
    Zhang, Zhaofu
    Liu, Sheng
    Shen, Shengnan
    Guo, Yuzheng
    NANOMATERIALS, 2023, 13 (08)
  • [48] Nanoparticle Detection on SEM Images Using a Neural Network and Semi-Synthetic Training Data
    Lopez Gutierrez, Jorge David
    Abundez Barrera, Itzel Maria
    Torres Gomez, Nayely
    NANOMATERIALS, 2022, 12 (11)
  • [49] REPRESENTING AND LEARNING DISTRIBUTIONS WITH THE AID OF A NEURAL-NETWORK
    HURRION, RD
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 1993, 44 (10) : 1013 - 1023
  • [50] A neural-network based model of bioreaction kinetics
    Saxen, B
    Saxen, H
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 1996, 74 (01) : 124 - 131