Benchmarking and validating algorithms that estimate pKa values of drugs based on their molecular structures

被引:82
作者
Meloun, Milan [1 ]
Bordovska, Sylva [1 ]
机构
[1] Univ Pardubice, Fac Chem Technol, Dept Analyt Chem, Pardubice 53210, Czech Republic
关键词
pK(a) prediction; pK(a) accuracy; dissociation constants; outliers; influential points; residuals; goodness-of-fit; Williams graph;
D O I
10.1007/s00216-007-1502-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The REGDIA regression diagnostics algorithm in S-Plus is introduced in order to examine the accuracy of pK(a) predictions made with four updated programs: PAL-LAS, MARVIN, ACD/pK(a) and SPARC. This report reviews the current status of computational tools for predicting the pK(a) values of organic drug-like compounds. Outlier predicted pK(a) values correspond to molecules that are poorly characterized by the pKa prediction program concerned. The statistical detection of outliers can fail because of masking and swamping effects. The Williams graph was selected to give the most reliable detection of outliers. Six statistical characteristics (F-exp, R-2, R-P(2), MEP, AIC, and s(e) in pK(a) units) of the results obtained when four selected pKa prediction algorithms were applied to three datasets were examined. The highest values of F-exp, R-2, R-P(2), the lowest values of MEP and s(e), and the most negative AIC were found using the ACD/pK(a) algorithm for pK(a) prediction, so this algorithm achieves the best predictive power and the most accurate results. The proposed accuracy test performed by the REGDIA program can also be applied to test the accuracy of other predicted values, such as log P, log D, aqueous solubility or certain physicochemical properties of drug molecules.
引用
收藏
页码:1267 / 1281
页数:15
相关论文
共 31 条
[1]  
*ACD LABS, 2007, PKA PRED 3 0
[2]  
*ACD LABS, 1997, RES TITR MEAS SEL DR
[3]  
*ACD LABS, 2007, ACD PKA DB VS EXPT C
[4]   High-throughput, in silico prediction of aqueous solubility based on one- and two-dimensional descriptors [J].
Engkvist, O ;
Wrede, P .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (05) :1247-1249
[5]   Application of multivariate data analysis methods to Comparative Molecular Field Analysis (CoMFA) data: Proton affinities and pKa prediction for nucleic acids components [J].
Gargallo, R ;
Sotriffer, CA ;
Liedl, KR ;
Rode, BM .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1999, 13 (06) :611-623
[6]  
GULYAS Z, 2007, PALLAS CLUSTER NEW S
[7]  
Habibi-Yangjeh A, 2005, B KOREAN CHEM SOC, V26, P2007
[8]   Prediction of pH-dependent aqueous solubility of druglike molecules [J].
Hansen, Niclas Tue ;
Kouskoumvekaki, Irene ;
Jorgensen, Flemming Steen ;
Brunak, Soren ;
Jonsdottir, Svava Osk .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) :2601-2609
[9]  
Hilal SH, 2003, EPA600R03030
[10]   DIRECT PREDICTION OF DISSOCIATION-CONSTANTS (PKAS) OF CLONIDINE-LIKE IMIDAZOLINES, 2-SUBSTITUTED IMIDAZOLES, AND 1-METHYL-2-SUBSTITUTED-IMIDAZOLES FROM 3D STRUCTURES USING A COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) APPROACH [J].
KIM, KH ;
MARTIN, YC .
JOURNAL OF MEDICINAL CHEMISTRY, 1991, 34 (07) :2056-2060