Predictions of High-Order Electric Properties of Molecules: Can We Benefit from Machine Learning?

被引:11
作者
Tran Tuan-Anh [1 ]
Zalesny, Robert [2 ]
机构
[1] Oxford Univ Clin Res Unit, Wellcome Trust Major Overseas Programme Viet Nam, Ho Chi Minh City, Vietnam
[2] Wroclaw Univ Sci & Technol, Dept Phys & Quantum Chem, Fac Chem, PL-50370 Wroclaw, Poland
来源
ACS OMEGA | 2020年 / 5卷 / 10期
关键词
NONLINEAR-OPTICAL PROPERTIES; BOND-LENGTH ALTERNATION; PUSH-PULL MOLECULES; 2-PHOTON ABSORPTION; 1ST HYPERPOLARIZABILITY; CHANNEL INTERFERENCE; ORGANIC-MOLECULES; EXCITED-STATE; DIPOLE-MOMENT; MODEL;
D O I
10.1021/acsomega.9b04339
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
There is an exigency of adopting machine learning techniques to screen and discover new materials which could address many societal and technological challenges. In this work, we follow this trend and employ machine learning to study (high-order) electric properties of organic compounds. The results of quantum-chemistry calculations of polarizability and first hyperpolarizability, obtained for more than 50,000 compounds, served as targets for machine learning-based predictions. The studied set of molecular structures encompasses organic push-pull molecules with variable linker lengths. Moreover, the diversified set of linkers, composed of alternating single/double and single/triple carbon- carbon bonds, was considered. This study demonstrates that the applied machine learning strategy allows us to obtain the correlation coefficients, between predicted and reference values of (hyper)polarizabilities, exceeding 0.9 on training, validation, and test set. However, in order to achieve such satisfactory predictive power, one needs to choose the training set appropriately, as the machine learning methods are very sensitive to the linker-type diversity in the training set, yielding catastrophic predictions in certain cases. Furthermore, the dependence of (hyper)polarizability on the length of spacers was studied in detail, allowing for explanation of the appreciably high accuracy of employed approaches.
引用
收藏
页码:5318 / 5325
页数:8
相关论文
共 7 条
  • [1] Can Density Functional Theory Be Trusted for High-Order Electric Properties? The Case of Hydrogen-Bonded Complexes
    Zalesny, Robert
    Medved, Miroslav
    Sitkiewicz, Sebastian P.
    Matito, Eduard
    Luis, Josep M.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (06) : 3570 - 3579
  • [2] Forecasting Based on High-Order Fuzzy-Fluctuation Trends and Particle Swarm Optimization Machine Learning
    Jia, Jingyuan
    Zhao, Aiwu
    Guan, Shuang
    SYMMETRY-BASEL, 2017, 9 (07):
  • [3] Can We Train Machine Learning Methods to Outperform the High-dimensional Propensity Score Algorithm?
    Karim, Mohammad Ehsanul
    Pang, Menglan
    Platt, Robert W.
    EPIDEMIOLOGY, 2018, 29 (02) : 191 - 198
  • [4] The dynamic programming high-order Dynamic Bayesian Networks learning for identifying effective connectivity in human brain from fMRI
    Dang, Shilpa
    Chaudhury, Santanu
    Lall, Brejesh
    Roy, Prasun Kumar
    JOURNAL OF NEUROSCIENCE METHODS, 2017, 285 : 33 - 44
  • [5] The Max-Min High-Order Dynamic Bayesian Network Learning for Identifying Gene Regulatory Networks from Time-Series Microarray Data
    Li, Yifeng
    Ngom, Alioune
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2013, : 83 - 90
  • [6] Can lies be faked? Comparing low-stakes and high-stakes deception video datasets from a Machine Learning perspective
    Camara, Mateus Karvat
    Postal, Adriana
    Maul, Tomas Henrique
    Paetzold, Gustavo Henrique
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [7] How Machine Learning and High-Resolution Imagery Can Improve Melt Pond Retrieval From MODIS Over Current Spectral Unmixing Techniques
    Wright, Nicholas C.
    Polashenski, Chris M.
    JOURNAL OF GEOPHYSICAL RESEARCH-OCEANS, 2020, 125 (02)