Comparison of representational spaces based on structural information in the development of QSAR models for benzylamino enaminone derivatives

被引:2
作者
Cerruela Garcia, G. [1 ]
Palacios-Bejarano, B. [1 ]
Luque Ruiz, I. [1 ]
Gomez-Nieto, M. A. [1 ]
机构
[1] Univ Cordoba, Dept Comp & Numer Anal, E-14071 Cordoba, Spain
关键词
representational spaces; similarity; approximate similarity; molecular fragments; genetic algorithm; DESCRIPTORS; SIMILARITY; GENERATION; PREDICTION; ALGORITHM;
D O I
10.1080/1062936X.2012.719543
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In this paper we study different representational spaces of molecule data sets based on 2D representation models for the building of QSAR models for the prediction of the activity of 37 benzylamino enaminone derivatives. Approximations based on classical similarity calculated from fingerprint representation of molecules and isomorphism obtained using sub-graph matching algorithms are compared to fragmentation-based approximations using partial least squares and genetic algorithms. The influence of the anchored position of a non-common moiety and the kind of substituents in the common core structure of the data set are analysed, demonstrating the anomalous behaviour of some molecules and therefore the difficulty in building prediction models. These problems are solved by considering approximate similarity models. These models tune the prediction equations based on the size of the substituent and the anchored position, by adjusting the contribution of each substituent in similarity measurements calculated between the molecule data sets.
引用
收藏
页码:751 / 774
页数:24
相关论文
共 45 条
[1]  
[Anonymous], 2002, Principal components analysis
[2]  
[Anonymous], 2010, JCHEM VERS 5 3 7
[3]  
[Anonymous], 2010, v7.10.0 (R2010a)
[4]   Analysis and use of fragment-occurrence data in similarity-based virtual screening [J].
Arif, Shereena M. ;
Holliday, John D. ;
Willett, Peter .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2009, 23 (09) :655-668
[5]   Performance of (consensus) kNN QSAR for predicting estrogenic activity in a large diverse set of organic compounds [J].
Asikainen, AH ;
Ruuskanen, J ;
Tuppurainen, KA .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2004, 15 (01) :19-32
[6]   Fragment-based prediction of cytochromes P450 2D6 and 1A2 inhibition by recursive partitioning [J].
Burton, J. ;
Danloy, E. ;
Vercauteren, D.P. .
SAR and QSAR in Environmental Research, 2009, 20 (3-4) :185-205
[7]   Analysis and Study of Molecule Data Sets Using Snowflake Diagrams of Weighted Maximum Common Subgraph Trees [J].
Cerruela Garcia, Gonzalo ;
Luque Ruiz, Irene ;
Angel Gomez-Nieto, Miguel .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2011, 51 (06) :1216-1232
[8]   New QSAR prediction models derived from GPCR CB2-antagonistic triaryl bis-sulfone analogues by a combined molecular morphological and pharmacophoric approach [J].
Chen, J. -Z. ;
Myint, K. -Z. ;
Xie, X. -Q. .
SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2011, 22 (5-6) :525-544
[9]  
Consonni V, 2010, CHALL ADV COMPUT CHE, V8, P29, DOI 10.1007/978-1-4020-9783-6_3
[10]  
Cousins I.T., 1995, SAR QSAR ENVIRON RES, V3, P279