A comparative study of support vector machine, artificial neural network and Bayesian classifier for mutagenicity prediction

被引:0
作者
Anju Sharma
Rajnish Kumar
Pritish Kumar Varadwaj
Ausaf Ahmad
Ghulam Md Ashraf
机构
[1] Indian Institute of Information Technology Allahabad Deoghat,Department of Bioinformatics
[2] Amity University Uttar Pradesh (AUUP),Amity Institute of Biotechnology (AIB)
来源
Interdisciplinary Sciences: Computational Life Sciences | 2011年 / 3卷
关键词
Artificial Neural Network; Bayesian classifier; mutagenicity; prediction; Support Vector Machine;
D O I
暂无
中图分类号
学科分类号
摘要
Mutagenicity is the capability of a chemical to carry out mutations in genetic material of an organism. In order to curtail expensive drug failures due to mutagenicity found in late development or even in clinical trials, it is crucial to determine potential mutagenicity problems as early as possible. In this work we have proposed three different classifiers, i.e. Support Vector Machine (SVM), Artificial Neural Network (ANN) and Bayesian classifiers, for the prediction of mutagenicity of compounds based on seventeen descriptors. Among the three classifiers Radial Basis Function (RBF) kernel based SVM classifier appeared to be more accurate for classifying the compounds under study on mutagens and non-mutagens. The overall prediction accuracy of SVM model was found to be 71.73% which was appreciably higher than the accuracy of ANN based classifier (59.72%) and Bayesian classifier (66.61%). It suggests that SVM based prediction model can be used for predicting mutagenicity more accurately compared to ANN and Bayesian classifier for data under consideration.
引用
收藏
页码:232 / 239
页数:7
相关论文
共 125 条
[1]  
Ashby J.(1991)Definitive relationships among chemical structure, carcinogenicity and mutagenicity for 301 chemicals tested by the U.S NTP. Mutat Res 257 229-306
[2]  
Tennant R.W.(2001)Drug design by machine learning: Support vector machines for pharmaceutical data analysis Comput Chem 26 5-14
[3]  
Burbidge R.(2001)The new pre-clinical paradigm: compound optimization in early and late phase drug discovery Curr Top Med Chem 1 353-366
[4]  
Trotter M.(2002)Comparison of the computer programs DEREK and TOPKAT to predict bacterial mutagenicity. Deductive estimate of risk from existing knowledge. Toxicity prediction by computer assisted technology Mutagenesis 17 321-329
[5]  
Buxton B.(1997)TOPKAT 5.0 and modulation of toxicity Mut Res 379 514-519
[6]  
Holden S.(2003)Predictive toxicology: Benchmarking molecular descriptors and statistical methods J Chem Inf Comput Sci 43 1463-1470
[7]  
Caldwell G.W.(1985)A model based on molecular structure descriptors for predicting mutagenicity of organic compounds Toxicol Environ Chem 10 157-170
[8]  
Ritchie D.M.(2002)Computer systems for the prediction of toxicity: An update Adv Drug Deliv Rev 54 417-431
[9]  
Masucci J.A.(2006)The blue obelisk — interoperability in chemical informatics J Chem Inf Model 46 991-998
[10]  
Hageman W.(2009)The WEKA data mining software: An update SIGKDD Explorations 11 10-18