Multi-class Mode of Action Classification of Toxic Compounds Using Logic Based Kernel Methods

被引:2
作者
Lodhi, Huma [1 ]
Muggleton, Stephen [2 ]
Sternberg, Mike J. E. [3 ]
机构
[1] Brunel Univ, Sch Informat Syst Comp & Math, Uxbridge UB8 3PH, Middx, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
[3] Univ London Imperial Coll Sci Technol & Med, Div Mol Biosci, Struct Bioinformat Grp, London SW7 2AZ, England
基金
英国生物技术与生命科学研究理事会;
关键词
Predictive toxicology; Support vector machines; Inductive logic programming; Multi-class classification; Molecular modelling; Drug design; GRAPH KERNELS; MACHINE; TOOL;
D O I
10.1002/minf.201000083
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Toxicity prediction is essential for drug design and development of effective therapeutics. In this paper we present an in silico strategy, to identify the mode of action of toxic compounds, that is based on the use of a novel logic based kernel method. The technique uses support vector machines in conjunction with the kernels constructed from first order rules induced by an Inductive Logic Programming system. It constructs multi-class models by using a divide and conquer reduction strategy that splits multi-classes into binary groups and solves each individual problem recursively hence generating an underlying decision list structure. In order to evaluate the effectiveness of the approach for chemoinformatics problems like predictive toxicology, we apply it to toxicity classification in aquatic systems. The method is used to identify and classify 442 compounds with respect to the mode of action. The experimental results show that the technique successfully classifies toxic compounds and can be useful in assessing environmental risks. Experimental comparison of the performance of the proposed multi-class scheme with the standard multi-class Inductive Logic Programming algorithm and multi-class Support Vector Machine yields statistically significant results and demonstrates the potential power and benefits of the approach in identifying compounds of various toxic mechanisms.
引用
收藏
页码:655 / 664
页数:10
相关论文
共 35 条
[1]  
Allwein E., 2002, JMLR, V1, P113
[2]   A novel logic-based approach for quantitative toxicology prediction [J].
Amini, Ata ;
Muggleton, Stephen H. ;
Lodhi, Huma ;
Sternberg, Michael J. E. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (03) :998-1006
[3]  
[Anonymous], P 10 INT S FDN INT S
[4]  
[Anonymous], 2001, Journal of Machine Learning Research
[5]   Tandem 3D-QSARs approach as a valuable tool to predict binding affinity data: Design of new Gly/NMDA receptor antagonists as a key study [J].
Bacilieri, M. ;
Varano, F. ;
Deflorian, F. ;
Marini, M. ;
Catarzi, D. ;
Colotta, V. ;
Filacchioni, G. ;
Galli, A. ;
Costagli, C. ;
Kaseda, C. ;
Moro, S. .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2007, 47 (05) :1913-1922
[6]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[7]  
CHANG CC, 2001, LIBSVIM LIB SUPPORT
[8]   Assessment and modeling of the toxicity of organic chemicals to Chlorella vulgaris:: Development of a novel database [J].
Cronin, MTD ;
Netzeva, TI ;
Dearden, JC ;
Edwards, R ;
Worgan, ADP .
CHEMICAL RESEARCH IN TOXICOLOGY, 2004, 17 (04) :545-554
[9]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358
[10]   REPRESENTATION OF MOLECULAR ELECTROSTATIC POTENTIALS BY TOPOLOGICAL FEATURE MAPS [J].
GASTEIGER, J ;
LI, XZ ;
RUDOLPH, C ;
SADOWSKI, J ;
ZUPAN, J .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1994, 116 (11) :4608-4620