A rotation-translation invariant molecular descriptor of partial charges and its use in ligand-based virtual screening

被引:23
作者
Berenger, Francois [1 ]
Voet, Arnout [1 ]
Lee, Xiao Yin [1 ]
Zhang, Kam Y. J. [1 ]
机构
[1] RIKEN, Inst Labs, Zhang Initiat Res Unit, Wako, Saitama 3510198, Japan
关键词
RTI molecular descriptor; Partial charges; Ligand-based virtual screening; Spatial auto-correlation; Cross-correlation; Linear binning; ACPC; CORTICOSTEROID-BINDING GLOBULIN; AUTO-CORRELATION DESCRIPTOR; SURFACE-PROPERTIES; FORCE FIELD; SIMILARITY; AUTOCORRELATION; SHAPE; IDENTIFICATION; EQUALIZATION; PERCEPTION;
D O I
10.1186/1758-2946-6-23
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Background: Measures of similarity for chemical molecules have been developed since the dawn of chemoinformatics. Molecular similarity has been measured by a variety of methods including molecular descriptor based similarity, common molecular fragments, graph matching and 3D methods such as shape matching. Similarity measures are widespread in practice and have proven to be useful in drug discovery. Because of our interest in electrostatics and high throughput ligand-based virtual screening, we sought to exploit the information contained in atomic coordinates and partial charges of a molecule. Results: A new molecular descriptor based on partial charges is proposed. It uses the autocorrelation function and linear binning to encode all atoms of a molecule into two rotation-translation invariant vectors. Combined with a scoring function, the descriptor allows to rank-order a database of compounds versus a query molecule. The proposed implementation is called ACPC (AutoCorrelation of Partial Charges) and released in open source. Extensive retrospective ligand-based virtual screening experiments were performed and other methods were compared with in order to validate the method and associated protocol. Conclusions: While it is a simple method, it performed remarkably well in experiments. At an average speed of 1649 molecules per second, it reached an average median area under the curve of 0.81 on 40 different targets; hence validating the proposed protocol and implementation.
引用
收藏
页数:12
相关论文
共 60 条
[1]  
[Anonymous], 2009, MOL DESCRIPTORS CHEM
[2]  
[Anonymous], 1994, J. Comput. Graph. Stat, DOI [DOI 10.1080/10618600.1994.10474656, DOI 10.2307/1390904, 10.1080/10618600.1994.10474656]
[3]  
[Anonymous], 2010, OECHEM 1 7 4
[4]  
[Anonymous], 2007, Numerical Recipes
[5]   The comparison of geometric and electronic properties of molecular surfaces by neural networks: Application to the analysis of corticosteroid-binding globulin activity of steroids [J].
Anzali, S ;
Barnickel, G ;
Krug, M ;
Sadowski, J ;
Wagener, M ;
Gasteiger, J ;
Polanski, J .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1996, 10 (06) :521-534
[6]   ElectroShape: fast molecular similarity calculations incorporating shape, chirality and electrostatics [J].
Armstrong, M. Stuart ;
Morris, Garrett M. ;
Finn, Paul W. ;
Sharma, Raman ;
Moretti, Loris ;
Cooper, Richard I. ;
Richards, W. Graham .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2010, 24 (09) :789-801
[7]   Further development of reduced graphs for identifying bioactive compounds [J].
Barker, EJ ;
Gardiner, EJ ;
Gillet, VJ ;
Kitts, P ;
Morris, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (02) :346-356
[8]   Locating biologically active compounds in medium-sized heterogeneous datasets by topological autocorrelation vectors: Dopamine and benzodiazepine agonists [J].
Bauknecht, H ;
Zell, A ;
Bayer, H ;
Levi, P ;
Wagener, M ;
Sadowski, J ;
Gasteiger, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (06) :1205-1213
[9]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[10]   How Similar Are Similarity Searching Methods? A Principal Component Analysis of Molecular Descriptor Space [J].
Bender, Andreas ;
Jenkins, Jeremy L. ;
Scheiber, Josef ;
Sukuru, Sai Chelan K. ;
Glick, Meir ;
Davies, John W. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2009, 49 (01) :108-119