MLatom 2: An Integrative Platform for Atomistic Machine Learning

被引:39
作者
Dral, Pavlo O. [1 ,2 ,3 ]
Ge, Fuchun [2 ,3 ]
Xue, Bao-Xin [1 ,2 ,3 ]
Hou, Yi-Fan [1 ,2 ,3 ]
Pinheiro, Max, Jr. [4 ]
Huang, Jianxing [1 ,2 ,3 ]
Barbatti, Mario [4 ]
机构
[1] Fujian Prov Key Lab Theoret & Computat Chem, State Key Lab Phys Chem Solid Surfaces, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Dept Chem, Xiamen 361005, Peoples R China
[3] Xiamen Univ, Coll Chem & Chem Engn, Xiamen 361005, Peoples R China
[4] Aix Marseille Univ, CNRS, ICR, Marseille, France
基金
欧洲研究理事会; 中国国家自然科学基金;
关键词
Machine learning; Quantum chemistry; Kernel ridge regression; Neural networks; Gaussian process regression; GAUSSIAN APPROXIMATION POTENTIALS; ZETA VALENCE QUALITY; MOLECULAR-DYNAMICS; BASIS-SETS; ENERGY SURFACES; ATOMS LI; ACCURACY; CHEMISTRY;
D O I
10.1007/s41061-021-00339-5
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Atomistic machine learning (AML) simulations are used in chemistry at an ever-increasing pace. A large number of AML models has been developed, but their implementations are scattered among different packages, each with its own conventions for input and output. Thus, here we give an overview of our MLatom 2 software package, which provides an integrative platform for a wide variety of AML simulations by implementing from scratch and interfacing existing software for a range of state-of-the-art models. These include kernel method-based model types such as KREG (native implementation), sGDML, and GAP-SOAP as well as neural-network-based model types such as ANI, DeepPot-SE, and PhysNet. The theoretical foundations behind these methods are overviewed too. The modular structure of MLatom allows for easy extension to more AML model types. MLatom 2 also has many other capabilities useful for AML simulations, such as the support of custom descriptors, farthest-point and structure-based sampling, hyperparameter optimization, model evaluation, and automatic learning curve generation. It can also be used for such multi-step tasks as Delta-learning, self-correction approaches, and absorption spectrum simulation within the machine-learning nuclear-ensemble approach. Several of these MLatom 2 capabilities are showcased in application examples.
引用
收藏
页数:41
相关论文
共 65 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] [Anonymous], 2011, ADV NEURAL INFORM PR
  • [3] [Anonymous], 1999, LAPACK Users' Guide, DOI [10.1137/1.9780898719604, DOI 10.1137/1.9780898719604]
  • [4] [Anonymous], 2013, MLatom: A Package for Atomistic Simulations with Machine Learning
  • [5] On the origin of the shift between vertical excitation and band maximum in molecular photoabsorption
    Bai, Shuming
    Mansour, Ritam
    Stojanovic, Ljiljana
    Toldo, Josene M.
    Barbatti, Mario
    [J]. JOURNAL OF MOLECULAR MODELING, 2020, 26 (05)
  • [6] Newton-X: a surface-hopping program for nonadiabatic molecular dynamics
    Barbatti, Mario
    Ruckenbauer, Matthias
    Plasser, Felix
    Pittner, Jiri
    Granucci, Giovanni
    Persico, Maurizio
    Lischka, Hans
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2014, 4 (01) : 26 - 33
  • [7] Barbatti Mario., 2019, Newton-x: a package for newtonian dynamics close to the crossing seam
  • [8] Gaussian approximation potentials: A brief tutorial introduction
    Bartok, Albert P.
    Csanyi, Gabor
    [J]. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY, 2015, 115 (16) : 1051 - 1057
  • [9] On representing chemical environments
    Bartok, Albert P.
    Kondor, Risi
    Csanyi, Gabor
    [J]. PHYSICAL REVIEW B, 2013, 87 (18)
  • [10] Gaussian Approximation Potentials: The Accuracy of Quantum Mechanics, without the Electrons
    Bartok, Albert P.
    Payne, Mike C.
    Kondor, Risi
    Csanyi, Gabor
    [J]. PHYSICAL REVIEW LETTERS, 2010, 104 (13)