SAMPL6 logP challenge: machine learning and quantum mechanical approaches

被引:9
作者
Patel, Prajay [1 ]
Kuntz, David M. [2 ,3 ]
Jones, Michael R. [4 ]
Brooks, Bernard R. [4 ]
Wilson, Angela K. [1 ,2 ,3 ]
机构
[1] Michigan State Univ, Dept Chem, E Lansing, MI 48824 USA
[2] Univ North Texas, Dept Chem, Denton, TX 76203 USA
[3] Univ North Texas, Ctr Adv Sci Comp & Modeling CASCaM, Denton, TX 76203 USA
[4] NHLBI, Lab Computat Biol, NIH, Bldg 10, Bethesda, MD 20892 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
SAMPL6; Machine learning; DFT; DLPNO-ccCA; QSAR; Partition coefficient; CONSISTENT COMPOSITE APPROACH; GENERALIZED GRADIENT APPROXIMATION; QUANTITATIVE STRUCTURE-ACTIVITY; WATER PARTITION-COEFFICIENTS; BASIS-SETS; EXCHANGE; ATOMS; DFT; THERMOCHEMISTRY; SOLVATION;
D O I
10.1007/s10822-020-00287-0
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Two different types of approaches: (a) approaches that combine quantitative structure activity relationships, quantum mechanical electronic structure methods, and machine-learning and, (b) electronic structure vertical solvation approaches, were used to predict the logP coefficients of 11 molecules as part of the SAMPL6 logP blind prediction challenge. Using electronic structures optimized with density functional theory (DFT), several molecular descriptors were calculated for each molecule, including van der Waals areas and volumes, HOMO/LUMO energies, dipole moments, polarizabilities, and electrophilic and nucleophilic superdelocalizabilities. A multilinear regression model and a partial least squares model were used to train a set of 97 molecules. As well, descriptors were generated using the molecular operating environment and used to create additional machine learning models. Electronic structure vertical solvation approaches considered include DFT and the domain-based local pair natural orbital methods combined with the solvated variant of the correlation consistent composite approach.
引用
收藏
页码:495 / 510
页数:16
相关论文
共 57 条
  • [1] DFT and ab initio composite methods: Investigation of oxygen fluoride species
    Alsunaidi, Zainab H. A.
    Wilson, Angela K.
    [J]. COMPUTATIONAL AND THEORETICAL CHEMISTRY, 2016, 1095 : 71 - 82
  • [2] [Anonymous], 2019, NUCLEIC ACIDS RES, DOI DOI 10.1093/NAR/GKY1033
  • [3] [Anonymous], 2018, MOL OP ENV MOE
  • [4] Bannan CC, 2018, J COMPUT AID MOL DES, V32, P1165, DOI 10.1007/s10822-018-0169-z
  • [5] DENSITY-FUNCTIONAL THERMOCHEMISTRY .3. THE ROLE OF EXACT EXCHANGE
    BECKE, AD
    [J]. JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (07) : 5648 - 5652
  • [6] DENSITY-FUNCTIONAL EXCHANGE-ENERGY APPROXIMATION WITH CORRECT ASYMPTOTIC-BEHAVIOR
    BECKE, AD
    [J]. PHYSICAL REVIEW A, 1988, 38 (06): : 3098 - 3100
  • [7] Towards the intrinsic error of the correlation consistent Composite Approach (ccCA)
    DeYonker, Nathan J.
    Wilson, Brent R.
    Pierpont, Aaron W.
    Cundari, Thomas R.
    Wilson, Angela K.
    [J]. MOLECULAR PHYSICS, 2009, 107 (8-12) : 1107 - 1121
  • [8] The correlation consistent composite approach (ccCA):: An alternative to the Gaussian-n methods
    DeYonker, NJ
    Cundari, TR
    Wilson, AK
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2006, 124 (11)
  • [9] Gaussian basis sets for use in correlated molecular calculations. X. The atoms aluminum through argon revisited
    Dunning, TH
    Peterson, KA
    Wilson, AK
    [J]. JOURNAL OF CHEMICAL PHYSICS, 2001, 114 (21) : 9244 - 9253
  • [10] Assessment of the Perdew-Burke-Ernzerhof exchange-correlation functional
    Ernzerhof, M
    Scuseria, GE
    [J]. JOURNAL OF CHEMICAL PHYSICS, 1999, 110 (11) : 5029 - 5036