NPred: QSAR classification model for identifying plant based naturally occurring anti-cancerous inhibitors

被引:22
作者
Dhiman, Kanika [1 ]
Agarwal, Subhash Mohan [1 ]
机构
[1] Inst Cytol & Prevent Oncol, Bioinformat Div, 1-7,Sect 39, Noida 201301, India
关键词
ENALOS INSILICONANO PLATFORM; DRUG DISCOVERY; MOLECULAR DOCKING; DERIVATIVES; PRODUCTS; NANOPARTICLES; 3D-QSAR; SERVER; LEADS; TOOL;
D O I
10.1039/c6ra02772e
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The prediction of naturally occurring plant based compounds as anticancer agents is the key to developing new chemical entities in the area of therapeutic oncology. Therefore, in the present study various machine learning techniques viz. Naive Bayesian classifier (NB), sequential minimal optimization (SMO), instance based learner (IBK) and random forest (RF) have been used to develop models of the relationship between the chemical structures of plant based natural compounds and their anti-cancerous inhibition activity. These models were trained, tested and validated using 549 active and 424 inactive compounds deposited in the NPACT database. We observe that the random forest based model using 881 PubChem fingerprints showed the best performance with an MCC of 0.54 and an accuracy of 77.6% on a five-fold cross-validation set and an MCC of 0.35 with an accuracy of 68.4% on an independent external validation set. Also, a frequency-based feature selection method was used to identify the fingerprints that have differential occurrence percentages in an active inhibitor dataset from an inactive set. We find that almost the entire top 10 fingerprints (FP797, FP818, FP12, FP179, FP3, FP143, FP712, FP704, FP334 and FP711) are present in vincristine, vinblastine and paclitaxel, the three therapeutic drugs that are derived from natural products and used as anticancer drugs in clinics. Finally, we have also developed a web server NPred, to predict the potential of natural compounds as anticancer agents and thus help the researchers working in this area. We expect that the results of this study will pave the way for identifying and designing novel natural products as cancer growth inhibitors.
引用
收藏
页码:49395 / 49400
页数:6
相关论文
共 25 条
[1]   Ligand - based virtual screening procedure for the prediction and the identification of novel β-amyloid aggregation inhibitors using Kohonen maps and Counterpropagation Artificial Neural Networks [J].
Afantitis, Antreas ;
Melagraki, Georgia ;
Koutentis, Panayiotis A. ;
Sarimveis, Haralambos ;
Kollias, George .
EUROPEAN JOURNAL OF MEDICINAL CHEMISTRY, 2011, 46 (02) :497-508
[2]   3D-QSAR and Molecular Docking Studies on 3-Anilino-4-Arylmaleimide Derivatives as Glycogen Synthase Kinase-3ß Inhibitors [J].
Akhtar, Mymoona ;
Bharatam, Prasad V. .
CHEMICAL BIOLOGY & DRUG DESIGN, 2012, 79 (04) :560-571
[3]  
[Anonymous], 2007, STUDIES CLASSIFICATI
[4]   QSAR-Based Models for Designing Quinazoline/Imidazothiazoles/Pyrazolopyrimidines Based Inhibitors against Wild and Mutant EGFR [J].
Chauhan, Jagat Singh ;
Dhanda, Sandeep Kumar ;
Singla, Deepak ;
Agarwal, Subhash M. ;
Raghava, Gajendra P. S. .
PLOS ONE, 2014, 9 (07)
[5]   Natural products: A continuing source of novel drug leads [J].
Cragg, Gordon M. ;
Newman, David J. .
BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2013, 1830 (06) :3670-3695
[6]   Data mining in bioinformatics using Weka [J].
Frank, E ;
Hall, M ;
Trigg, L ;
Holmes, G ;
Witten, IH .
BIOINFORMATICS, 2004, 20 (15) :2479-2481
[7]   LIBP-Pred: web server for lipid binding proteins using structural network parameters; PDB mining of human cancer biomarkers and drug targets in parasites and bacteria [J].
Gonzalez-Diaz, Humberto ;
Munteanu, Cristian R. ;
Postelnicu, Lucian ;
Prado-Prado, Francisco ;
Gestal, Marcos ;
Pazos, Alejandro .
MOLECULAR BIOSYSTEMS, 2012, 8 (03) :851-862
[8]   The re-emergence of natural products for drug discovery in the genomics era [J].
Harvey, Alan L. ;
Edrada-Ebel, RuAngelie ;
Quinn, Ronald J. .
NATURE REVIEWS DRUG DISCOVERY, 2015, 14 (02) :111-129
[9]   Computer-aided drug discovery and development (CADDD):: In silico-chemico-biological approach [J].
Kapetanovic, I. M. .
CHEMICO-BIOLOGICAL INTERACTIONS, 2008, 171 (02) :165-176
[10]   Drug Discovery and Natural Products: End of an Era or an Endless Frontier? [J].
Li, Jesse W. -H. ;
Vederas, John C. .
SCIENCE, 2009, 325 (5937) :161-165