Optimal design of experiments in the context of machine-learning inter-atomic potentials: improving the efficiency and transferability of kernel based methods

被引:0
作者
Barzdajn, Bartosz [1 ]
Race, Christopher [2 ]
机构
[1] Univ Manchester, Oxford Rd, Manchester M139PL, England
[2] Univ Sheffield, Western Bank, Sheffield S102TN, England
基金
英国工程与自然科学研究理事会;
关键词
interatomic potentials; machine learning; optimal desing; material science; GAP; TOTAL-ENERGY CALCULATIONS;
D O I
10.1088/1361-651X/ada050
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data-driven machine learning (ML) models of atomistic interactions are often based on flexible and non-physical functions that can relate nuanced aspects of atomic arrangements to predictions of energies and forces. As a result, these potentials are only as good as the training data (usually the results of so-called ab initio simulations), and we need to ensure that we have enough information to make a model sufficiently accurate, reliable and transferable. The main challenge stems from the fact that descriptors of chemical environments are often sparse, high-dimensional objects without a well-defined continuous metric. Therefore, it is rather unlikely that any ad hoc method for selecting training examples will be indiscriminate, and it is easy to fall into the trap of confirmation bias, where the same narrow and biased sampling is used to generate training and test sets. We will show that an approach derived from classical concepts of statistical planning of experiments and optimal design can help to mitigate such problems at a relatively low computational cost. The key feature of the method we will investigate is that it allows us to assess the quality of the data without obtaining reference energies and forces-a so-called offline approach. In other words, we are focusing on an approach that is easy to implement and does not require sophisticated frameworks that involve automated access to high performance computing.
引用
收藏
页数:20
相关论文
共 40 条
  • [1] Gaussian approximation potentials: A brief tutorial introduction
    Bartok, Albert P.
    Csanyi, Gabor
    [J]. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2015, 115 (16) : 1051 - 1057
  • [2] On representing chemical environments
    Bartok, Albert P.
    Kondor, Risi
    Csanyi, Gabor
    [J]. PHYSICAL REVIEW B, 2013, 87 (18)
  • [3] Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons
    Bartok, Albert P.
    Payne, Mike C.
    Kondor, Risi
    Csanyi, Gabor
    [J]. PHYSICAL REVIEW LETTERS, 2010, 104 (13)
  • [4] Barzdajn B., 2024, Implementing conditional max-min designs in python Zenodo
  • [5] Towards exact molecular dynamics simulations with machine-learned force fields
    Chmiela, Stefan
    Sauceda, Huziel E.
    Mueller, Klaus-Robert
    Tkatchenko, Alexandre
    [J]. NATURE COMMUNICATIONS, 2018, 9
  • [6] Learn on the fly:: A hybrid classical and quantum-mechanical molecular dynamics simulation -: art. no. 175503
    Csányi, G
    Albaret, T
    Payne, MC
    De Vita, A
    [J]. PHYSICAL REVIEW LETTERS, 2004, 93 (17) : 175503 - 1
  • [7] Comparing molecules and solids across structural and alchemical space
    De, Sandip
    Bartok, Albert P.
    Csanyi, Gabor
    Ceriotti, Michele
    [J]. PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2016, 18 (20) : 13754 - 13769
  • [8] Gaussian Process Regression for Materials and Molecules
    Deringer, Volker L.
    Bartok, Albert P.
    Bernstein, Noam
    Wilkins, David M.
    Ceriotti, Michele
    Csanyi, Gabor
    [J]. CHEMICAL REVIEWS, 2021, 121 (16) : 10073 - 10141
  • [9] Atomic cluster expansion for accurate and transferable interatomic potentials
    Drautz, Ralf
    [J]. PHYSICAL REVIEW B, 2019, 99 (01)
  • [10] Machine-learned potentials for next-generation matter simulations
    Friederich, Pascal
    Hase, Florian
    Proppe, Jonny
    Aspuru-Guzik, Alan
    [J]. NATURE MATERIALS, 2021, 20 (06) : 750 - 761