Simple Idea to Generate Fragment and Pharmacophore Descriptors and Their Implications in Chemical Informatics

被引:6
作者
Catana, Cornel [1 ]
机构
[1] EMD Serono, Drug Discovery Informat, Rockland, MA 02370 USA
关键词
MOLECULAR SIMILARITY;
D O I
10.1021/ci800339p
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Using a well-defined set of fragments/pharmacophores, a new methodology to calculate fragment/pharmacophore descriptors for any molecule onto which at least one fragment/pharmacophore can be mapped is presented. To each fragment/pharmacophore present in a molecule, we attach a descriptor that is calculated by identifying the molecule's atoms onto which it maps and summing over its constituent atomic descriptors. The attached descriptors are named C-fragment/pharmacophore descriptors, and this methodology can be applied to any descriptors defined at the atomic level, such as the partition coefficient, molar refractivity, electrotopological state, etc. By using this methodology, the same fragment/pharmacophore can be shown to have different values in different molecules resulting in better discrimination power. As we know, fragment and pharmacophore fingerprints have a lot of applications in chemical informatics. This study has attempted to find the impact of replacing the traditional value of "1" in a fingerprint with real numbers derived form C-fragment/pharmacophore descriptors. One way to do this is to assess the utility of C-fragment/pharmacophore descriptors in modeling different end points. Here, we exemplify with data from CYP and hERG. The fact that, in many cases, the obtained models were fairly successful and C-fragment descriptors were ranked among the top ones supports the idea that they play an important role in correlation. When we modeled hERG with C-pharmacophore descriptors, however, the model performances decreased slightly, and we attribute this, mainly to the fact that there is no technique capable of handling multiple instances (states). We hope this will open new research, especially in the emerging field of machine learning. Further research is needed to see the impact of C-fragment/pharmacophore descriptors in similarity/dissimilarity applications.
引用
收藏
页码:543 / 548
页数:6
相关论文
共 19 条
[1]  
*ACC, 2008, PIP PIL VERS 6 0
[2]  
[Anonymous], 2008, MOL OP ENV MOE VERS
[3]   SUBSTRUCTURE SEARCHING METHODS - OLD AND NEW [J].
BARNARD, JM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1993, 33 (04) :532-538
[4]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[5]   Molecular similarity searching using atom environments, information-based feature selection, and a naive Bayesian classifier [J].
Bender, A ;
Mussa, HY ;
Glen, RC ;
Reiling, S .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :170-178
[6]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   MOLECULAR-STRUCTURE COMPARISON PROGRAM FOR IDENTIFICATION OF MAXIMAL COMMON SUBSTRUCTURES [J].
CONE, MM ;
VENKATARAGHAVAN, R ;
MCLAFFERTY, FW .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1977, 99 (23) :7668-7671
[9]   The signature molecular descriptor. 1. Using extended valence sequences in QSAR and QSPR studies [J].
Faulon, JL ;
Visco, DP ;
Pophale, RS .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (03) :707-720
[10]   STOCHASTIC GENERATOR OF CHEMICAL-STRUCTURE .1. APPLICATION TO THE STRUCTURE ELUCIDATION OF LARGE MOLECULES [J].
FAULON, JL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (05) :1204-1218