Probing the effects of broken symmetries in machine learning

被引:1
作者
Langer, Marcel F. [1 ]
Pozdnyakov, Sergey N. [1 ]
Ceriotti, Michele [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Inst Mat, Lab Computat Sci & Modeling, CH-1015 Lausanne, Switzerland
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 04期
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
machine learning; symmetry-constrained models; atomistic modeling; molecular simulations; THERMOSTATS;
D O I
10.1088/2632-2153/ad86a0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Symmetry is one of the most central concepts in physics, and it is no surprise that it has also been widely adopted as an inductive bias for machine-learning models applied to the physical sciences. This is especially true for models targeting the properties of matter at the atomic scale. Both established and state-of-the-art approaches, with almost no exceptions, are built to be exactly equivariant to translations, permutations, and rotations of the atoms. Incorporating symmetries-rotations in particular-constrains the model design space and implies more complicated architectures that are often also computationally demanding. There are indications that unconstrained models can easily learn symmetries from data, and that doing so can even be beneficial for the accuracy of the model. We demonstrate that an unconstrained architecture can be trained to achieve a high degree of rotational invariance, testing the impacts of the small symmetry breaking in realistic scenarios involving simulations of gas-phase, liquid, and solid water. We focus specifically on physical observables that are likely to be affected-directly or indirectly-by non-invariant behavior under rotations, finding negligible consequences when the model is used in an interpolative, bulk, regime. Even for extrapolative gas-phase predictions, the model remains very stable, even though symmetry artifacts are noticeable. We also discuss strategies that can be used to systematically reduce the magnitude of symmetry breaking when it occurs, and assess their impact on the convergence of observables.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Machine learning models for predicting volumetric errors based on scale and master balls artefact probing data
    Zeng, Min
    Feng, Miao
    Mayer, J. R. R.
    Bitar-Nehme, Elie
    Duong, Xuan Truong
    CIRP JOURNAL OF MANUFACTURING SCIENCE AND TECHNOLOGY, 2025, 59 : 135 - 157
  • [42] A machine learning framework to adjust for learning effects in medical device safety evaluation
    Koola, Jejo D.
    Ramesh, Karthik
    Mao, Jialin
    Ahn, Minyoung
    Davis, Sharon E.
    Govindarajulu, Usha
    Perkins, Amy M.
    Westerman, Dax
    Ssemaganda, Henry
    Speroff, Theodore
    Ohno-Machado, Lucila
    Ramsay, Craig R.
    Sedrakyan, Art
    Resnic, Frederic S.
    Matheny, Michael E.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 32 (01) : 206 - 217
  • [43] Machine learning in the estimation of causal effects: targeted minimum loss-based estimation and double/debiased machine learning
    Diaz, Ivan
    BIOSTATISTICS, 2020, 21 (02) : 353 - 358
  • [44] The Machine Learning Machine: A Tangible User Interface for Teaching Machine Learning
    Kaspersen, Magnus Hoholt
    Bilstrup, Karl-Emil Kjaer
    Petersen, Marianne Graves
    PROCEEDINGS OF THE FIFTEENTH INTERNATIONAL CONFERENCE ON TANGIBLE, EMBEDDED, AND EMBODIED INTERACTION, TEI 2021, 2021,
  • [45] Machine learning in molecular simulations of biomolecules
    Guan, Xing-Yue
    Huang, Heng-Yan
    Peng, Hua-Qi
    Liu, Yan-Hang
    Li, Wen-Fei
    Wei, Wang
    ACTA PHYSICA SINICA, 2023, 72 (24)
  • [46] The Adverse Effects of Code Duplication in Machine Learning Models of Code
    Allamams, Miltiadis
    PROCEEDINGS OF THE 2019 ACM SIGPLAN INTERNATIONAL SYMPOSIUM ON NEW IDEAS, NEW PARADIGMS, AND REFLECTIONS ON PROGRAMMING AND SOFTWARE (ONWARD!' 19), 2019, : 143 - 153
  • [47] Preliminary Evaluation of Search Space Characterization and Effects of Machine Learning
    Ghelarducci, Leo A.
    Garfield, Keith
    IEEE SOUTHEASTCON 2020, 2020,
  • [48] The Effects of Example-Based Explanations in a Machine Learning Interface
    Cai, Carrie J.
    Jongejan, Jonas
    Holbrook, Jess
    PROCEEDINGS OF IUI 2019, 2019, : 258 - 262
  • [49] The effects of data quality on machine learning performance on tabular data
    Mohammed, Sedir
    Budach, Lukas
    Feuerpfeil, Moritz
    Ihde, Nina
    Nathansen, Andrea
    Noack, Nele
    Patzlaff, Hendrik
    Naumann, Felix
    Harmouch, Hazar
    INFORMATION SYSTEMS, 2025, 132
  • [50] DrugClust: A machine learning approach for drugs side effects prediction
    Dimitri, Giovanna Maria
    Lio, Pietro
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2017, 68 : 204 - 210