Predicting Solvent-Dependent Nucleophilicity Parameter with a Causal Structure Property Relationship

被引:21
作者
Boobier, Samuel [1 ]
Liu, Yufeng [1 ]
Sharma, Krishna [1 ]
Hose, David R. J. [2 ]
Blacker, A. John [1 ]
Kapur, Nikil [3 ]
Nguyen, Bao N. [1 ]
机构
[1] Univ Leeds, Sch Chem, Inst Proc Res & Dev, Leeds LS2 9JT, W Yorkshire, England
[2] AstraZeneca, Chem Dev Pharmaceut Technol & Dev, Operat, Macclesfield SK10 2NA, Cheshire, England
[3] Univ Leeds, Sch Mech Engn, Leeds LS2 9JT, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
FUKUI FUNCTIONS; REACTIVITY; STEREOSELECTIVITY; FRAMEWORK; SYSTEM;
D O I
10.1021/acs.jcim.1c00610
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Solvent-dependent reactivity is a key aspect of synthetic science, which controls reaction selectivity. The contemporary focus on new, sustainable solvents highlights a need for reactivity predictions in different solvents. Herein, we report the excellent machine learning prediction of the nucleophilicity parameter N in the four most-common solvents for nucleophiles in the Mayr's reactivity parameter database (R-2 = 0.93 and 81.6% of predictions within +/- 2.0 of the experimental values with Extra Trees algorithm). A Causal Structure Property Relationship (CSPR) approach was utilized, with focus on the physicochemical relationships between the descriptors and the predicted parameters, and on rational improvements of the prediction models. The nucleophiles were represented with a series of electronic and steric descriptors and the solvents were represented with principal component analysis (PCA) descriptors based on the ACS Solvent Tool. The models indicated that steric factors do not contribute significantly, because of bias in the experimental database. The most important descriptors are solvent-dependent HOMO energy and Hirshfeld charge of the nucleophilic atom. Replacing DFT descriptors with Parameterization Method 6 (PM6) descriptors for the nucleophiles led to an 8.7-fold decrease in computational time, and an similar to 10% decrease in the percentage of predictions within +/- 2.0 and +/- 1.0 of the experimental values.
引用
收藏
页码:4890 / 4899
页数:10
相关论文
共 40 条
[1]   ORGANOSILICON COMPOUNDS .24. AN INVESTIGATION OF REPORTED CLEAVAGES OF ARYL-SILICON BONDS BY ALUMINIUM CHLORIDE [J].
AUSTIN, JD ;
SMITH, JD ;
EABORN, C .
JOURNAL OF THE CHEMICAL SOCIETY, 1963, (OCT) :4744-&
[2]   Origins of opposite absolute stereloselectivities in proline-catalyzed direct Mannich and aldol reactions [J].
Bahmanyar, S ;
Houk, KN .
ORGANIC LETTERS, 2003, 5 (08) :1249-1251
[3]   Machine learning with physicochemical relationships: solubility prediction in organic solvents and water [J].
Boobier, Samuel ;
Hose, David R. J. ;
Blacker, A. John ;
Nguyen, Bao N. .
NATURE COMMUNICATIONS, 2020, 11 (01)
[4]   Hybrid QSPR models for the prediction of the free energy of solvation of organic solute/solvent pairs [J].
Borhani, Tohid N. ;
Garcia-Munoz, Salvador ;
Luciani, Carla Vanesa ;
Galindo, Amparo ;
Adjiman, Claire S. .
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2019, 21 (25) :13706-13720
[5]   Seeking for Selectivity and Efficiency: New Approaches in the Synthesis of Raltegravir [J].
Caputo, Francesco ;
Corbetta, Stefano ;
Piccolo, Oreste ;
Vigo, Daniele .
ORGANIC PROCESS RESEARCH & DEVELOPMENT, 2020, 24 (06) :1149-1156
[6]   DFT analysis of the nucleophilicity of substituted pyridines and prediction of new molecules having nucleophilic character stronger than 4-pyrrolidino pyridine [J].
Deka, Kaustavmoni ;
Phukan, Prodeep .
JOURNAL OF CHEMICAL SCIENCES, 2016, 128 (04) :633-647
[7]   Toward a More Holistic Framework for Solvent Selection [J].
Diorazio, Louis J. ;
Hose, David R. J. ;
Adlington, Neil K. .
ORGANIC PROCESS RESEARCH & DEVELOPMENT, 2016, 20 (04) :760-773
[8]  
Famini GR, 1999, J PHYS ORG CHEM, V12, P645
[9]  
Frisch M.J., 2016, GAUSSIAN 16 REVISION, DOI DOI 10.1021/ACS.ORGLETT.4C03656/SUPPLFILE/OL4C03656SI001.PDF
[10]   Using Machine Learning To Predict Suitable Conditions for Organic Reactions [J].
Gao, Hanyu ;
Struble, Thomas J. ;
Coley, Connor W. ;
Wang, Yuran ;
Green, William H. ;
Jensen, Klavs F. .
ACS CENTRAL SCIENCE, 2018, 4 (11) :1465-1476