Chemical space-informed machine learning models for rapid predictions of x-ray photoelectron spectra of organic molecules

被引:0
|
作者
Tripathy, Susmita [1 ]
Das, Surajit [1 ]
Jindal, Shweta [1 ]
Ramakrishnan, Raghunathan [1 ]
机构
[1] Tata Inst Fundamental Res, Hyderabad 500046, India
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2024年 / 5卷 / 04期
关键词
x-ray photoelectron spectra; core-electron binding energy; density functional theory; machine learning; chemical space; LEVEL BINDING-ENERGIES; QUANTUM-CHEMISTRY; XPS SPECTRA; APPROXIMATION; SPECTROSCOPY; STATES; ATOMS;
D O I
10.1088/2632-2153/ad871d
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present machine learning models based on kernel-ridge regression for predicting x-ray photoelectron spectra of organic molecules originating from the K-shell ionization energies of carbon (C), nitrogen (N), oxygen (O), and fluorine (F) atoms. We constructed the training dataset through high-throughput calculations of K-shell core-electron binding energies (CEBEs) for 12 880 small organic molecules in the bigQM7 omega dataset, employing the Delta-SCF formalism coupled with meta-GGA-DFT and a variationally converged basis set. The models are cost-effective, as they require the atomic coordinates of a molecule generated using universal force fields while estimating the target-level CEBEs corresponding to DFT-level equilibrium geometry. We explore transfer learning by utilizing the atomic environment feature vectors learned using a graph neural network framework in kernel-ridge regression. Additionally, we enhance accuracy within the Delta-machine learning framework by leveraging inexpensive baseline spectra derived from Kohn-Sham eigenvalues. When applied to 208 combinatorially substituted uracil molecules larger than those in the training set, our analyses suggest that the models may not provide quantitatively accurate predictions of CEBEs but offer a strong linear correlation relevant for virtual high-throughput screening. We present the dataset and models as the Python module, cebeconf, to facilitate further explorations.
引用
收藏
页数:17
相关论文
共 41 条
  • [1] Theoretical assessment of vibrationally resolved C1s X-ray photoelectron spectra of simple cyclic molecules
    Hua, Weijie
    Tian, Guangjun
    Luo, Yi
    PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2020, 22 (35) : 20014 - 20026
  • [2] X-RAY PHOTOELECTRON SPECTRA STRUCTURE AND CHEMICAL BONDING IN AmO2
    Teterin, Yury A.
    Maslakov, Konstantin I.
    Ryzhkov, Mikhail V.
    Teterin, Anton Yu
    Ivanov, Kirill E.
    Kalmykov, Stepan N.
    Petrov, Vladimir G.
    NUCLEAR TECHNOLOGY & RADIATION PROTECTION, 2015, 30 (02) : 83 - 98
  • [3] Approaching multiplet splitting in X-ray photoelectron spectra by density functional theory methods: NO and O2 molecules as examples
    Sousa, Carmen
    Bagus, Paul S.
    Illas, Francesc
    CHEMICAL PHYSICS LETTERS, 2019, 731
  • [4] X-ray photoelectron spectra structure and chemical bond nature in NpO2
    Teterin, Yu. A.
    Teterin, A. Yu.
    Ivanov, K. E.
    Ryzhkov, M. V.
    Maslakov, K. I.
    Kalmykov, St. N.
    Petrov, V. G.
    Enina, D. A.
    PHYSICAL REVIEW B, 2014, 89 (03)
  • [5] A combined quantum chemical/molecular dynamics study of X-ray photoelectron spectra of polyvinyl alcohol using oligomer models
    Ehlert, Christopher
    Kroener, Dominik
    Saalfrank, Peter
    JOURNAL OF ELECTRON SPECTROSCOPY AND RELATED PHENOMENA, 2015, 199 : 38 - 45
  • [6] Automatic Identification of X-ray Absorption Fine Structure Spectra via Machine Learning
    Miyasaka, Naotoshi
    Gracia-Escobar, Fernando
    Takahashi, Keisuke
    JOURNAL OF PHYSICAL CHEMISTRY C, 2024, 128 (42) : 17921 - 17927
  • [7] Naive data mining and machine learning for high resolution, sparse x-ray spectra
    Teti, Emily S.
    Salazar, Sebastian
    Carpenter, Matthew H.
    APPLICATIONS OF MACHINE LEARNING 2022, 2022, 12227
  • [8] "Inverting" X-ray Absorption Spectra of Catalysts by Machine Learning in Search for Activity Descriptors
    Timoshenko, Janis
    Frenkel, Anatoly I.
    ACS CATALYSIS, 2019, 9 (11): : 10192 - 10211
  • [9] Curve fitting complex X-ray photoelectron spectra of graphite-supported copper nanoparticles using informed line shapes
    Fernandez, Vincent
    Kiani, Daniyal
    Fairley, Neal
    Felpin, Francois-Xavier
    Baltrusaitis, Jonas
    APPLIED SURFACE SCIENCE, 2020, 505
  • [10] Anomalous chemical shifts in X-ray photoelectron spectra of sulfur-containing compounds of silver (I) and (II)
    Grzelak, A.
    Jaron, T.
    Mazej, Z.
    Michalowski, T.
    Szarek, P.
    Grochala, W.
    JOURNAL OF ELECTRON SPECTROSCOPY AND RELATED PHENOMENA, 2015, 202 : 38 - 45