Structure-Based Drug Screening and Ligand-Based Drug Screening with Machine Learning

被引:34
作者
Fukunishi, Yoshifumi [1 ,2 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, BIRC, Koto Ku, Tokyo 1350064, Japan
[2] BioGrid Ctr Kansai, Osaka 5600082, Japan
关键词
Virtual screening; affinity fingerprint; machine learning; neural network model; support vector machine; decision tree; Bayesian model; self-organizing map; HIGH-THROUGHPUT DOCKING; SUPPORT VECTOR MACHINE; PROTEIN-COUPLED RECEPTORS; SCORING FUNCTION; NEURAL-NETWORKS; MOLECULAR-SURFACES; BINDING; INHIBITORS; PREDICTION; IDENTIFICATION;
D O I
10.2174/138620709788167890
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The initial stage of drug development is the hit (active) compound search from a pool of millions of compounds; for this process, in silico (virtual) screening has been successfully applied. One of the problems of in silico screening, however, is the low hit ratio in relation to the high computational cost and the long CPU time. This problem becomes serious in structure-based in silico screening. The major reason is the low accuracy of the estimation of protein-compound binding free energy. The problem of ligand-based in silico screening is that the conventional quantitative structure-activity relationship (QSAR) approach is not effective at predicting new hit compounds with new scaffolds. Recently, machine-learning approaches have been applied to in silico drug screening to overcome the above problems. We review here machine-learning approaches for both structure-based and ligand-based drug screening. Machine learning is used to improve database enrichment in two ways, namely by improving the docking score calculated by the protein-compound docking program and by calculating the optimal distance between the feature vectors of active and inactive compounds. Both approaches require compounds that are known to be active with respect to the target protein. In structure-based screening, the former approach is mainly used with a protein-compound affinity matrix. In ligand-based screening, both the former and latter approaches are used, and the latter approach can be applied to various kinds of descriptors, such as 1D/2D descriptors/fingerprints and the affinity fingerprint given by the protein-compound affinity matrix.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 79 条
[1]   ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].
ABAGYAN, R ;
TOTROV, M ;
KUZNETSOV, D .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (05) :488-506
[2]  
[Anonymous], 1993, C4.5: Programs for machine learning
[3]   1. Endothelin antagonists: Search for surrogates of methylendioxyphenyl by means of a Kohonen neural network [J].
Anzali, S ;
Mederski, WWKR ;
Osswald, M ;
Dorsch, D .
BIOORGANIC & MEDICINAL CHEMISTRY LETTERS, 1998, 8 (01) :11-16
[4]   The comparison of geometric and electronic properties of molecular surfaces by neural networks: Application to the analysis of corticosteroid-binding globulin activity of steroids [J].
Anzali, S ;
Barnickel, G ;
Krug, M ;
Sadowski, J ;
Wagener, M ;
Gasteiger, J ;
Polanski, J .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 1996, 10 (06) :521-534
[5]  
ANZALI S, 1996, NEURAL NETWORKS QSAR, P209
[6]   Locating biologically active compounds in medium-sized heterogeneous datasets by topological autocorrelation vectors: Dopamine and benzodiazepine agonists [J].
Bauknecht, H ;
Zell, A ;
Bayer, H ;
Levi, P ;
Wagener, M ;
Sadowski, J ;
Gasteiger, J .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1996, 36 (06) :1205-1213
[7]  
Baxter CA, 1998, PROTEINS, V33, P367, DOI 10.1002/(SICI)1097-0134(19981115)33:3<367::AID-PROT6>3.0.CO
[8]  
2-W
[9]   APPLICATIONS OF HIGH-RESOLUTION SELF-ORGANIZING MAPS TO RETROSYNTHETIC AND QSAR ANALYSIS [J].
BIENFAIT, B .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (04) :890-898
[10]   Molecular similarity based on DOCK-generated fingerprints [J].
Briem, H ;
Kuntz, ID .
JOURNAL OF MEDICINAL CHEMISTRY, 1996, 39 (17) :3401-3408