Structure-Based Drug Screening and Ligand-Based Drug Screening with Machine Learning

被引:34
作者
Fukunishi, Yoshifumi [1 ,2 ]
机构
[1] Natl Inst Adv Ind Sci & Technol, BIRC, Koto Ku, Tokyo 1350064, Japan
[2] BioGrid Ctr Kansai, Osaka 5600082, Japan
关键词
Virtual screening; affinity fingerprint; machine learning; neural network model; support vector machine; decision tree; Bayesian model; self-organizing map; HIGH-THROUGHPUT DOCKING; SUPPORT VECTOR MACHINE; PROTEIN-COUPLED RECEPTORS; SCORING FUNCTION; NEURAL-NETWORKS; MOLECULAR-SURFACES; BINDING; INHIBITORS; PREDICTION; IDENTIFICATION;
D O I
10.2174/138620709788167890
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The initial stage of drug development is the hit (active) compound search from a pool of millions of compounds; for this process, in silico (virtual) screening has been successfully applied. One of the problems of in silico screening, however, is the low hit ratio in relation to the high computational cost and the long CPU time. This problem becomes serious in structure-based in silico screening. The major reason is the low accuracy of the estimation of protein-compound binding free energy. The problem of ligand-based in silico screening is that the conventional quantitative structure-activity relationship (QSAR) approach is not effective at predicting new hit compounds with new scaffolds. Recently, machine-learning approaches have been applied to in silico drug screening to overcome the above problems. We review here machine-learning approaches for both structure-based and ligand-based drug screening. Machine learning is used to improve database enrichment in two ways, namely by improving the docking score calculated by the protein-compound docking program and by calculating the optimal distance between the feature vectors of active and inactive compounds. Both approaches require compounds that are known to be active with respect to the target protein. In structure-based screening, the former approach is mainly used with a protein-compound affinity matrix. In ligand-based screening, both the former and latter approaches are used, and the latter approach can be applied to various kinds of descriptors, such as 1D/2D descriptors/fingerprints and the affinity fingerprint given by the protein-compound affinity matrix.
引用
收藏
页码:397 / 408
页数:12
相关论文
共 79 条
[11]   In vitro and in silico affinity fingerprints:: Finding similarities beyond structural classes [J].
Briem, H ;
Lessel, UF .
PERSPECTIVES IN DRUG DISCOVERY AND DESIGN, 2000, 20 (01) :231-244
[12]  
BURKARD U, 2005, CHEMOINFOMATICS A, P435
[13]   Comparison of support vector machine and artificial neural network systems for drug/nondrug classification [J].
Byvatov, E ;
Fechner, U ;
Sadowski, J ;
Schneider, G .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (06) :1882-1889
[14]   Structure-based identification of binding sites, native ligands and potential inhibitors for G-protein coupled receptors [J].
Cavasotto, CN ;
Orry, AJW ;
Abagyan, RA .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 51 (03) :423-433
[15]   STRUCTURE-BASED DRUG DESIGN [J].
COLMAN, PM .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1994, 4 (06) :868-874
[16]   Virtual screening to enrich a compound collection with CDK2 inhibitors using docking, scoring, and composite scoring models [J].
Cotesta, S ;
Giordanetto, F ;
Trosset, JY ;
Crivori, P ;
Kroemer, RT ;
Stouten, PFW ;
Vulpetti, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 60 (04) :629-643
[17]   COMPARATIVE MOLECULAR-FIELD ANALYSIS (COMFA) .1. EFFECT OF SHAPE ON BINDING OF STEROIDS TO CARRIER PROTEINS [J].
CRAMER, RD ;
PATTERSON, DE ;
BUNCE, JD .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1988, 110 (18) :5959-5967
[18]   Predicting CNS permeability of drug molecules: comparison of neural network and support vector machine algorithms [J].
Doniger, S ;
Hofmann, T ;
Yeh, J .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (06) :849-864
[19]   Virtual screening of biogenic amine-binding G-protein coupled receptors: Comparative evaluation of protein- and ligand-based virtual screening protocols [J].
Evers, A ;
Hessler, G ;
Matter, H ;
Klabunde, T .
JOURNAL OF MEDICINAL CHEMISTRY, 2005, 48 (17) :5448-5465
[20]   Classification of chemical compounds by protein-compound docking for use in designing a focused library [J].
Fukunishi, Y ;
Mikami, Y ;
Takedomi, K ;
Yamanouchi, M ;
Shima, H ;
Nakamura, H .
JOURNAL OF MEDICINAL CHEMISTRY, 2006, 49 (02) :523-533