Rank-based ant system method for non-linear QSPR analysis: QSPR studies of the solubility parameter

被引:16
作者
Bagheri, M. [2 ,3 ]
Golbraikh, A. [1 ]
机构
[1] Univ N Carolina, Eshelman Sch Pharm, Div Med Chem & Nat Prod, Lab Mol Modeling, Chapel Hill, NC 27515 USA
[2] Univ Tehran, Fac Engn, Dept Chem Engn, Tehran, Iran
[3] Univ British Columbia, Dept Chem & Biol Engn, Vancouver, BC V5Z 1M9, Canada
关键词
computer-aided molecular design (CAMD); quantitative structure-property relationship (QSPR); rank-based ant system feature selection (RBAS-FS); rank-based ant system support vector regression (RBAS-SVR); solubility parameter (delta); STRUCTURE-PROPERTY RELATIONSHIP; K-NEAREST-NEIGHBOR; ORGANIC-COMPOUNDS; AQUEOUS SOLUBILITY; VARIABLE SELECTION; PURE COMPONENTS; PREDICTION; QSAR; VALIDATION; REGRESSION;
D O I
10.1080/1062936X.2011.623356
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The solubility parameter (delta) plays a unique role in the development of stable pharmaceutical formulations for assessing phase segregation during product synthesis. Understanding this parameter helps to determine how a drug substance will behave when processed or when dosed in vivo. The aim of this work was to develop a novel comprehensive yet rapid and accurate Quantitative Structure-Property Relationship (QSPR) method based on the rank-based ant system feature selection. The method was coupled with the multiple linear regression and support vector regression and applied to the assessment of solubility parameters for a diverse dataset of 1804 chemical compounds. The models were validated by solubility prediction of 360 test set compounds which were not used in building models. The developed models have high prediction power characterized by r(2) values 0.75 and 0.82, and RMSE values 1.96 and 1.65 (J/(cm(3)))(0.5) for the external test set. Various validation techniques and comparison results with the novel optimized support vector regression indicate that the developed models can be used to determine the solubility parameters for a diverse set of chemicals with an acceptable accuracy. The developed models can be beneficial for designing new chemical materials with desired solubility parameter values.
引用
收藏
页码:59 / 86
页数:28
相关论文
共 96 条
[1]  
[Anonymous], 2007, HYPERCHEM REL 8 0 WI
[2]  
[Anonymous], 2010, EV PROC DES DAT
[3]  
[Anonymous], MATLAB
[4]  
[Anonymous], 2008, Handbook of molecular descriptors
[5]  
[Anonymous], 1999, Swarm Intelligence
[6]  
[Anonymous], 2007, DRAGON WIND SOFTW MO
[7]  
[Anonymous], 2003, Introduction to Nessus
[8]  
Atkinson Anthony Curtes, 1985, Plots, transformations and regression
[9]  
an introduction to graphical methods of diagnostic regression analysis
[10]  
Bagheri M., 2010, AICHE ANN M SALT LAK