In Silico Prediction of Aqueous Solubility: A Multimodel Protocol Based on Chemical Similarity

被引:29
作者
Chevillard, Florent [1 ,2 ]
Lagorce, David [1 ]
Reynes, Christelle [1 ,3 ]
Villoutreix, Bruno O. [1 ]
Vayer, Philippe [4 ]
Miteva, Maria A. [1 ]
机构
[1] Univ Paris Diderot, Inserm UMR S 973, F-75013 Paris, France
[2] Univ Marburg, Inst Pharmaceut Chem, D-35037 Marburg, Germany
[3] Univ Montpellier I, UFR Pharm, Lab Phys Ind & Traitement Informat EA 2415, F-34093 Montpellier 5, France
[4] Technol Servier, BioInformat Modelling Dept, F-45007 Orleans 1, France
关键词
solubility prediction; chemical structure similarity; QSPR models; multimodel optimization; ORGANIC-MOLECULES; DRUG SOLUBILITY; CHALLENGE; DISCOVERY; SELECTION; SET;
D O I
10.1021/mp300234q
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Aqueous solubility is one of the most important ADMET properties to assess and to optimize during the drug discovery process. At present, accurate prediction of solubility remains very challenging and there is an important need of independent benchmarking of the existing in silico models such as to suggest solutions for their improvement. In this study, we developed a new protocol for improved solubility prediction by combining several existing models available in commercial or free software packages. We first performed an evaluation of ten in silico models for aqueous solubility prediction on several data sets in order to assess the reliability of the methods, and we proposed a new diverse data set of 150 molecules as relevant test set, SolDiv150. We developed a random forest protocol to evaluate the performance of different fingerprints for aqueous solubility prediction based on molecular structure similarity. Our protocol, called a "multimodel protocol", allows selecting the most accurate model for a compound of interest among the employed models or software packages, achieving r(2) of 0.84 when applied to SolDiv150. We also found that all models assessed here performed better on druglike molecules than on real drugs, thus additional improvement is needed in this direction. Overall, our approach enlarges the applicability domain as demonstrated by the more accurate results for solubility prediction obtained using our protocol in comparison to using individual models.
引用
收藏
页码:3127 / 3135
页数:9
相关论文
共 42 条
[1]   Experimental solubility profiling of marketed CNS drugs, exploring solubility limit of CNS discovery candidate [J].
Alelyunas, Yun W. ;
Empfield, James R. ;
McCarthy, Dennis ;
Spreen, Russell C. ;
Bui, Khanh ;
Pelosi-Kilby, Luciana ;
Shen, Cindy .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 2010, 20 (24) :7312-7316
[2]   Revisiting the General Solubility Equation: In Silico Prediction of Aqueous Solubility Incorporating the Effect of Topographical Polar Surface Area [J].
Ali, Jogoth ;
Camilleri, Patrick ;
Brown, Marc B. ;
Hutt, Andrew J. ;
Kirton, Stewart B. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2012, 52 (02) :420-428
[3]  
[Anonymous], 2005, R LANG ENV STAT COMP
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Butina D, 2003, J CHEM INF COMP SCI, V43, P837, DOI 10.1021/6020279y
[6]   Prediction of aqueous solubility of a diverse set of compounds using quantitative structure-property relationships [J].
Cheng, AL ;
Merz, KM .
JOURNAL OF MEDICINAL CHEMISTRY, 2003, 46 (17) :3572-3580
[7]   VolSurf: a new tool for the pharmacokinetic optimization of lead compounds [J].
Cruciani, G ;
Pastor, M ;
Guba, W .
EUROPEAN JOURNAL OF PHARMACEUTICAL SCIENCES, 2000, 11 :S29-S39
[8]   ESOL: Estimating aqueous solubility directly from molecular structure [J].
Delaney, JS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (03) :1000-1005
[9]   Predicting aqueous solubility from structure [J].
Delaney, JS .
DRUG DISCOVERY TODAY, 2005, 10 (04) :289-295
[10]   Biological assay challenges from compound solubility: strategies for bioassay optimization [J].
Di, L ;
Kerns, EH .
DRUG DISCOVERY TODAY, 2006, 11 (9-10) :446-451