An Overview of Data Mining Algorithms in Drug Induced Toxicity Prediction

被引:13
作者
Omer, Ankur [1 ,2 ]
Singh, Poonam [1 ,2 ]
Yadav, N. K. [1 ]
Singh, R. K. [1 ,2 ]
机构
[1] CSIR Cent Drug Res Inst, Div Toxicol, Lucknow, Uttar Pradesh, India
[2] Acad Sci & Innovat Res AcSIR, New Delhi, India
关键词
Bioinformatics; computational prediction; data mining; in silico; machine learning; toxicity prediction; NEURAL-NETWORKS; IN-VITRO; CLASSIFICATION; MODELS; CONFIDENCE; REGRESSION; KNOWLEDGE; SELECTION; SIMULATOR; PROTEIN;
D O I
10.2174/1389557514666140219110244
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
The growth in chemical diversity has increased the need to adjudicate the toxicity of different chemical compounds raising the burden on the demand of animal testing. The toxicity evaluation requires time consuming and expensive undertaking, leading to the deprivation of the methods employed for screening chemicals pointing towards the need to develop more efficient toxicity assessment systems. Computational approaches have reduced the time as well as the cost for evaluating the toxicity and kinetic behavior of any chemical. The accessibility of a large amount of data and the intense need of turning this data into useful information have attracted the attention towards data mining. Machine Learning, one of the powerful data mining techniques has evolved as the most effective and potent tool for exploring new insights on combinatorial relationships among various experimental data generated. The article accounts on some sophisticated machine learning algorithms like Artificial Neural Networks (ANN), Support Vector Machine (SVM), k-mean clustering and Self Organizing Maps (SOM) with some of the available tools used for classification, sorting and toxicological evaluation of data, clarifying, how data mining and machine learning interact cooperatively to facilitate knowledge discovery. Addressing the association of some commonly used expert systems, we briefly outline some real world applications to consider the crucial role of data set partitioning.
引用
收藏
页码:345 / 354
页数:10
相关论文
共 95 条
  • [1] Abraham A., 2005, Artificial neural networks. Handbook of measuring system design
  • [2] Naive Bayes QSDR classification based on spiral-graph Shannon entropies for protein biomarkers in human colon cancer
    Aguiar-Pulido, Vanessa
    Munteanu, Cristian R.
    Seoane, Jose A.
    Fernandez-Blanco, Enrique
    Perez-Montoto, Lazaro G.
    Gonzalez-Diaz, Humberto
    Dorado, Julian
    [J]. MOLECULAR BIOSYSTEMS, 2012, 8 (06) : 1716 - 1722
  • [3] [Anonymous], 2002, NAEG SONNIA VERS 4 2
  • [4] The better predictive model:: High q2 for the training set or low root mean square error of prediction for the test set?
    Aptula, AO
    Jeliazkova, NG
    Schultz, TW
    Cronin, MTD
    [J]. QSAR & COMBINATORIAL SCIENCE, 2005, 24 (03): : 385 - 396
  • [5] The utility of structure-activity relationship (SAR) models for prediction and covariate selection in developmental toxicity: Comparative analysis of logistic regression and decision tree models
    Arena, VC
    Sussman, NB
    Mazumdar, S
    Yu, S
    Macina, OT
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2004, 15 (01) : 1 - 18
  • [6] Ayodele T.O., 2010, NEW ADV MACHINE LEAR, DOI DOI 10.5772/9385
  • [7] Balkenius C., 1994, LUCS, V30
  • [8] Belal M., 2007, ENG TECHNOLOGY, V4
  • [9] Nexa: A scalable neural simulator with integrated analysis
    Benjaminsson, Simon
    Lansner, Anders
    [J]. NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2012, 23 (04) : 254 - 271
  • [10] Bishop C., 2006, PATTERN RECOGN, DOI DOI 10.1117/1.2819119