Estimation of ADME Properties with Substructure Pattern Recognition

被引:272
作者
Shen, Jie [1 ]
Cheng, Feixiong [1 ]
Xu, You [1 ]
Li, Weihua [1 ]
Tang, Yun [1 ]
机构
[1] E China Univ Sci & Technol, Sch Pharm, Dept Pharmaceut Sci, Shanghai 200237, Peoples R China
关键词
BLOOD-BRAIN-BARRIER; IN-SILICO PREDICTION; DRUG DISCOVERY; PERMEABILITY; DESCRIPTORS; CLASSIFICATION; PENETRATION; ABSORPTION; ACCURACY; MODELS;
D O I
10.1021/ci100104j
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Over the past decade, absorption, distribution, metabolism, and excretion (ADME) property evaluation has become one of the most important issues in the process of drug discovery and development. Since in vivo and in vitro evaluations are costly and laborious, in silico techniques had been widely used to estimate ADME properties of chemical compounds. Traditional prediction methods usually try to build a functional relationship between a set of molecular descriptors and a given ADME property. Although traditional methods have been successfully used in many cases, the accuracy and efficiency of molecular descriptors must be concerned. Herein, we report a new classification method based on substructure pattern recognition, in which each molecule is represented as a substructure pattern fingerprint based on a predefined substructure dictionary, and then a support vector machine (SVM) algorithm is applied to build the prediction model. Therefore, a direct connection between substructures and molecular properties is built. The most important substructure patterns can be identified via the information gain analysis, which could help to interpret the models from a medicinal chemistry perspective. Afterward, this method was verified with two data sets, one for blood-brain barrier (BBB) penetration and the other for human intestinal absorption (HIA). The results demonstrated that the overall predictive accuracies of the best HIA model for the training and test sets were 98.5 and 98.8%, and the overall predictive accuracies of the best BBB model for the training and test sets were 98.8 and 98.4%, which confirmed the reliability of our method. In the additional validations, the predictive accuracies were 94 and 69.5% for the HIA and the BBB models, respectively. Moreover, some of the representative key substructure patterns which significantly correlated with the HIA and BBB penetration properties were also presented.
引用
收藏
页码:1034 / 1041
页数:8
相关论文
共 32 条
[1]   Blood-brain barrier permeation models: Discriminating between potential CNS and non-CNS drugs including P-glycoprotein substrates [J].
Adenot, M ;
Lahana, R .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (01) :239-248
[2]  
[Anonymous], HDB RES MACHINE LEAR
[3]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[4]   Emerging chemical patterns: A new methodology for molecular classification and compound selection [J].
Auer, Jens ;
Bajorath, Juergen .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (06) :2502-2514
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]  
Daylight Chemical Information Systems I, FING SCREEN SIM
[7]   Reoptimization of MDL keys for use in drug discovery [J].
Durant, JL ;
Leland, BA ;
Henry, DR ;
Nourse, JG .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (06) :1273-1280
[8]  
*FREE SOFTW FND IN, OP BAB
[9]   Kernel functions for attributed molecular graphs -: A new similarity-based approach to ADME prediction in classification and regression [J].
Fröhlich, H ;
Wegner, JK ;
Sieker, F ;
Zell, A .
QSAR & COMBINATORIAL SCIENCE, 2006, 25 (04) :317-326
[10]   In silico prediction of blood brain barrier permeability: An artificial neural network model [J].
Garg, P ;
Verma, J .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (01) :289-297