Automatic design of decision-tree induction algorithms tailored to flexible-receptor docking data

被引:21
作者
Barros, Rodrigo C. [1 ]
Winck, Ana T. [2 ]
Machado, Karina S. [3 ]
Basgalupp, Marcio P. [4 ]
de Carvalho, Andre C. P. L. F. [1 ]
Ruiz, Duncan D. [5 ]
de Souza, Osmar Norberto [5 ]
机构
[1] Univ Sao Paulo, Sao Carlos, SP, Brazil
[2] Univ Fed Santa Maria, BR-97119900 Santa Maria, RS, Brazil
[3] Fed Univ Rio Grande, Rio Grande, Brazil
[4] Univ Fed Sao Paulo, Sao Jose Dos Campos, Brazil
[5] Pontificia Univ Catolica Rio Grande do Sul, Porto Alegre, RS, Brazil
基金
巴西圣保罗研究基金会;
关键词
MOLECULAR DOCKING; PROTEIN FUNCTION; DRUG DESIGN; WILD-TYPE; TUBERCULOSIS; PREDICTION; COMPLEX; DISCRETIZATION; FLEXIBILITY; SELECTION;
D O I
10.1186/1471-2105-13-310
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: This paper addresses the prediction of the free energy of binding of a drug candidate with enzyme InhA associated with Mycobacterium tuberculosis. This problem is found within rational drug design, where interactions between drug candidates and target proteins are verified through molecular docking simulations. In this application, it is important not only to correctly predict the free energy of binding, but also to provide a comprehensible model that could be validated by a domain specialist. Decision-tree induction algorithms have been successfully used in drug-design related applications, specially considering that decision trees are simple to understand, interpret, and validate. There are several decision-tree induction algorithms available for general-use, but each one has a bias that makes it more suitable for a particular data distribution. In this article, we propose and investigate the automatic design of decision-tree induction algorithms tailored to particular drug-enzyme binding data sets. We investigate the performance of our new method for evaluating binding conformations of different drug candidates to InhA, and we analyze our findings with respect to decision tree accuracy, comprehensibility, and biological relevance. Results: The empirical analysis indicates that our method is capable of automatically generating decision-tree induction algorithms that significantly outperform the traditional C4.5 algorithm with respect to both accuracy and comprehensibility. In addition, we provide the biological interpretation of the rules generated by our approach, reinforcing the importance of comprehensible predictive models in this particular bioinformatics application. Conclusions: We conclude that automatically designing a decision-tree algorithm tailored to molecular docking data is a promising alternative for the prediction of the free energy from the binding of a drug candidate with a flexible-receptor.
引用
收藏
页数:14
相关论文
共 57 条
[1]   SPENDING ON NEW DRUG DEVELOPMENT [J].
Adams, Christopher Paul ;
Brantner, Van Vu .
HEALTH ECONOMICS, 2010, 19 (02) :130-141
[2]   CNS permeability of drugs predicted by a decision tree [J].
Andres, C ;
Hutter, MC .
QSAR & COMBINATORIAL SCIENCE, 2006, 25 (04) :305-309
[3]  
Barros R. C., 2011, Proceedings of the 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA), P450, DOI 10.1109/ISDA.2011.6121697
[4]  
Barros R.C., 2011, GECCO (Companion Material), P567, DOI DOI 10.1145/2001858.2002050
[5]  
Barros RC, 2010, P 2010 ACM S APPL CO, P1131, DOI DOI 10.1145/1774088.1774327
[6]   A Hyper-Heuristic Evolutionary Algorithm for Automatically Designing Decision-Tree Algorithms [J].
Barros, Rodrigo C. ;
Basgalupp, Marcio P. ;
de Carvalho, Andre C. P. L. F. ;
Freitas, Alex A. .
PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, :1237-1244
[7]   Evolutionary model trees for handling continuous classes in machine learning [J].
Barros, Rodrigo C. ;
Ruiz, Duncan D. ;
Basgalupp, Marcio P. .
INFORMATION SCIENCES, 2011, 181 (05) :954-971
[8]   A Survey of Evolutionary Algorithms for Decision-Tree Induction [J].
Barros, Rodrigo Coelho ;
Basgalupp, Marcio Porto ;
de Carvalho, Andre C. P. L. F. ;
Freitas, Alex A. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (03) :291-312
[9]   Lexicographic multi-objective evolutionary induction of decision trees [J].
Basgalupp, Marcia P. ;
de Carvalho, Andre C. P. L. F. ;
Barros, Rodrigo C. ;
Ruiz, Duncan D. ;
Freitas, Alex A. .
INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2009, 1 (1-2) :105-117
[10]  
Basgalupp MP., 2009, 2009 ACM SAC, P1085