Ligand-based virtual screening by novelty detection with self-organizing maps

被引:22
作者
Hristozov, Dimitar
Oprea, Tudor I.
Gasteiger, Johann
机构
[1] Univ Erlangen Nurnberg, Centrum Comp Chem, D-91052 Erlangen, Germany
[2] Univ New Mexico, Sch Med, Div Biocomp, Albuquerque, NM 87131 USA
关键词
D O I
10.1021/ci700040r
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
We describe a novel method for ligand-based virtual screening, based on utilizing Self-Organizing Maps (SOM) as a novelty detection device. Novelty detection (or one-class classification) refers to the attempt of identifying patterns that do not belong to the space covered by a given data set. In ligand-based virtual screening, chemical structures perceived as novel lie outside the known activity space and can therefore be discarded from further investigation. In this context, the concept of "novel structure" refers to a compound, which is unlikely to share the activity of the query structures. Compounds not perceived as "novel" are suspected to share the activity of the query structures. Nowadays, various databases contain active structures but access to compounds which have been found to be inactive in a biological assay is limited. This work addresses this problem via novelty detection, which does not require proven inactive compounds. The structures are described by spatial autocorrelation functions weighted by atomic physicochemical properties. Different methods for selecting a subset of targets from a larger set are discussed. A comparison with similarity search based on Daylight fingerprints followed by data fusion is presented. The two methods complement each other to a large extent. In a retrospective screening of the WOMBAT database novelty detection with SOM gave enrichment factors between 105 and 462-an improvement over the similarity search based on Daylight fingerprints between 25% and 100%,. when the 100 top ranked structures were considered. Novelty detection with SOM is applicable (1) to improve the retrieval of potentially active compounds also in concert with other virtual screening methods; (2) as a library design tool for discarding a large number of compounds, which are unlikely to possess a given biological activity; and (3) for selecting a small number of potentially active compounds from a large data set.
引用
收藏
页码:2044 / 2062
页数:19
相关论文
共 49 条
[1]  
*ADRIANA, 2006, COD VERS 1 0
[2]  
[Anonymous], 2003, P WORKSH SELF ORG MA
[3]   Locating biologically active compounds in medium-sized heterogeneous datasets by topological autocorrelation vectors: Dopamine and benzodiazepine agonists [J].
Bauknecht, H ;
Zell, A ;
Bayer, H ;
Levi, P ;
Wagener, M ;
Sadowski, J ;
Gasteiger, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (06) :1205-1213
[4]   Bayes affinity fingerprints improve retrieval rates in virtual screening and define orthogonal bioactivity space: When are multitarget drugs a feasible concept? [J].
Bender, Andreas ;
Jenkins, Jeremy L. ;
Glick, Meir ;
Deng, Zhan ;
Nettles, James H. ;
Davies, John W. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) :2445-2456
[5]   Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets [J].
Butina, D .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04) :747-750
[6]   The Mahalanobis distance [J].
De Maesschalck, R ;
Jouan-Rimbaud, D ;
Massart, DL .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2000, 50 (01) :1-18
[7]   Effectiveness of retrieval in similarity searches of chemical databases: A review of performance measures [J].
Edgar, SJ ;
Holliday, JD ;
Willett, P .
JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2000, 18 (4-5) :343-357
[8]   Virtual screening of biogenic amine-binding G-protein coupled receptors: Comparative evaluation of protein- and ligand-based virtual screening protocols [J].
Evers, A ;
Hessler, G ;
Matter, H ;
Klabunde, T .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (17) :5448-5465
[9]   Comparison of correlation vector methods for ligand-based similarity searching [J].
Fechner, U ;
Franke, L ;
Renner, S ;
Schneider, P ;
Schneider, G .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2003, 17 (10) :687-698
[10]   ITERATIVE PARTIAL EQUALIZATION OF ORBITAL ELECTRONEGATIVITY - A RAPID ACCESS TO ATOMIC CHARGES [J].
GASTEIGER, J ;
MARSILI, M .
TETRAHEDRON, 1980, 36 (22) :3219-3228