MRlogP: Transfer Learning Enables Accurate logP Prediction Using Small Experimental Training Datasets

被引:4
作者
Chen, Yan-Kai [1 ]
Shave, Steven [1 ]
Auer, Manfred [1 ]
机构
[1] Univ Edinburgh, Sch Biol Sci, Kings Bldg, Edinburgh EH9 3BF, Midlothian, Scotland
基金
英国医学研究理事会; 英国惠康基金;
关键词
lipophilicity prediction; logP prediction; transfer learning; physicochemical property prediction; WATER PARTITION-COEFFICIENTS; PHYSICOCHEMICAL PARAMETERS; DRUG-LIKE; LIPOPHILICITY; SUBSTRUCTURE;
D O I
10.3390/pr9112029
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Small molecule lipophilicity is often included in generalized rules for medicinal chemistry. These rules aim to reduce time, effort, costs, and attrition rates in drug discovery, allowing the rejection or prioritization of compounds without the need for synthesis and testing. The availability of high quality, abundant training data for machine learning methods can be a major limiting factor in building effective property predictors. We utilize transfer learning techniques to get around this problem, first learning on a large amount of low accuracy predicted logP values before finally tuning our model using a small, accurate dataset of 244 druglike compounds to create MRlogP, a neural network-based predictor of logP capable of outperforming state of the art freely available logP prediction methods for druglike small molecules. MRlogP achieves an average root mean squared error of 0.988 and 0.715 against druglike molecules from Reaxys and PHYSPROP. We have made the trained neural network predictor and all associated code for descriptor generation freely available. In addition, MRlogP may be used online via a web interface.
引用
收藏
页数:9
相关论文
共 33 条
[1]   New Substructure Filters for Removal of Pan Assay Interference Compounds (PAINS) from Screening Libraries and for Their Exclusion in Bioassays [J].
Baell, Jonathan B. ;
Holloway, Georgina A. .
JOURNAL OF MEDICINAL CHEMISTRY, 2010, 53 (07) :2719-2740
[2]  
Bickerton GR, 2012, NAT CHEM, V4, P90, DOI [10.1038/NCHEM.1243, 10.1038/nchem.1243]
[3]   Computation of octanol-water partition coefficients by guiding an additive model with knowledge [J].
Cheng, Tiejun ;
Zhao, Yuan ;
Li, Xun ;
Lin, Fu ;
Xu, Yong ;
Zhang, Xinglong ;
Li, Yan ;
Wang, Renxiao ;
Lai, Luhua .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (06) :2140-2148
[4]   Freely Available Conformer Generation Methods: How Good Are They? [J].
Ebejer, Jean-Paul ;
Morris, Garrett M. ;
Deane, Charlotte M. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (05) :1146-1158
[5]   ATOMIC PHYSICOCHEMICAL PARAMETERS FOR 3-DIMENSIONAL STRUCTURE DIRECTED QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS .3. MODELING HYDROPHOBIC INTERACTIONS [J].
GHOSE, AK ;
PRITCHETT, A ;
CRIPPEN, GM .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1988, 9 (01) :80-90
[6]   Predicting the equilibrium partitioning of organic compounds using just one linear solvation energy relationship (LSER) [J].
Goss, KU .
FLUID PHASE EQUILIBRIA, 2005, 233 (01) :19-22
[7]   Finding the sweet spot: the role of nature and nurture in medicinal chemistry [J].
Hann, Michael M. ;
Keserue, Gyoergy M. .
NATURE REVIEWS DRUG DISCOVERY, 2012, 11 (05) :355-365
[8]  
Lawson AJ, 2014, ACS SYM SER, V1164, P127
[9]   Drug-like properties and the causes of poor solubility and poor permeability [J].
Lipinski, CA .
JOURNAL OF PHARMACOLOGICAL AND TOXICOLOGICAL METHODS, 2000, 44 (01) :235-249
[10]  
Lipinski Christopher A, 2004, Drug Discov Today Technol, V1, P337, DOI 10.1016/j.ddtec.2004.11.007