Classification of toxicity effects of biotransformed hepatic drugs using whale optimized support vector machines

被引:63
作者
Tharwat, Alaa [1 ,2 ]
Moemen, Yasmine S. [2 ,3 ]
Hassanien, Aboul Ella [2 ,4 ]
机构
[1] Suez Canal Univ, Fac Engn, Ismailia, Egypt
[2] Sci Res Grp Egypt SRGE, Cairo, Egypt
[3] Menoufia Univ, Natl Liver Inst, Dept Clin Pathol, Menoufia, Egypt
[4] Cairo Univ, Fac Comp & Informat, Giza, Egypt
关键词
Imbalanced dataset; Random sampling; Synthetic Minority Over-sampling; Technique (SMOTE); Support Vector Machines (SVM); Whale Optimization Algorithm (WOA); Toxic effects; FEATURE-SELECTION; IMBALANCED DATA; ROUGH SETS; SYSTEM; SVM; PERFORMANCE; PREDICTION; PARAMETERS; DISCOVERY; BEHAVIOR;
D O I
10.1016/j.jbi.2017.03.002
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Measuring toxicity is an important step in drug development. Nevertheless, the current experimental methods used to estimate the drug toxicity are expensive and time-consuming, indicating that they are not suitable for large-scale evaluation of drug toxicity in the early stage of drug development. Hence, there is a high demand to develop computational models that can predict the drug toxicity risks. In this study, we used a dataset that consists of 553 drugs that biotransformed in liver. The toxic effects were calculated for the current data, namely, mutagenic, tumorigenic, irritant and reproductive effect. Each drug is represented by 31 chemical descriptors (features). The proposed model consists of three phases. In the first phase, the most discriminative subset of features is selected using rough set-based methods to reduce the classification time while improving the classification performance. In the second phase, different sampling methods such as Random Under-Sampling, Random Over-Sampling and Synthetic Minority Oversampling Technique (SMOTE), BorderLine SMOTE and Safe Level SMOTE are used to solve the problem of imbalanced dataset. In the third phase, the Support Vector Machines (SVM) classifier is used to classify an unknown drug into toxic or non-toxic. SVM parameters such as the penalty parameter and kernel parameter have a great impact on the classification accuracy of the model. In this paper, Whale Optimization Algorithm (WOA) has been proposed to optimize the parameters of SVM, so that the classification error can be reduced. The experimental results proved that the proposed model achieved high sensitivity to all toxic effects. Overall, the high sensitivity of the WOA + SVM model indicates that it could be used for the prediction of drug toxicity in the early stage of drug development. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:132 / 149
页数:18
相关论文
共 72 条
  • [61] Ting KM, 2002, IEEE T KNOWL DATA EN, V14, P659, DOI 10.1109/TKDE.2002.1000348
  • [62] Toxicogenomics and drug discovery: will new technologies help us produce better drugs?
    Ulrich, R
    Friend, SH
    [J]. NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (01) : 84 - 88
  • [63] Molecular properties that influence the oral bioavailability of drug candidates
    Veber, DF
    Johnson, SR
    Cheng, HY
    Smith, BR
    Ward, KW
    Kopple, KD
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2002, 45 (12) : 2615 - 2623
  • [64] Toxicity-indicating structural patterns
    von Korff, M
    Sander, T
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (02) : 536 - 544
  • [65] Discernibility matrix based algorithm for reduction of attributes
    Wang, Ruizhi
    Miao, Duoqian
    Hu, Guirong
    [J]. 2006 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY, WORKSHOPS PROCEEDINGS, 2006, : 477 - +
  • [66] ADMET Evaluation in Drug Discovery. 12. Development of Binary Classification Models for Prediction of hERG Potassium Channel Blockage
    Wang, Sichao
    Li, Youyong
    Wang, Junmei
    Chen, Lei
    Zhang, Liling
    Yu, Huidong
    Hou, Tingjun
    [J]. MOLECULAR PHARMACEUTICS, 2012, 9 (04) : 996 - 1010
  • [67] Feature selection based on rough sets and particle swarm optimization
    Wang, Xiangyang
    Yang, Jie
    Teng, Xiaolong
    Xia, Weijun
    Jensen, Richard
    [J]. PATTERN RECOGNITION LETTERS, 2007, 28 (04) : 459 - 471
  • [68] AERIAL OBSERVATION OF FEEDING-BEHAVIOR IN 4 BALEEN WHALES - EUBALAENA-GLACIALIS, BALAENOPTERA-BOREALIS, MEGAPTERA-NOVAEANGLIAE, AND BALAENOPTERA-PHYSALUS
    WATKINS, WA
    SCHEVILL, WE
    [J]. JOURNAL OF MAMMALOGY, 1979, 60 (01) : 155 - 163
  • [69] DrugBank:: a comprehensive resource for in silico drug discovery and exploration
    Wishart, David S.
    Knox, Craig
    Guo, An Chi
    Shrivastava, Savita
    Hassanali, Murtaza
    Stothard, Paul
    Chang, Zhan
    Woolsey, Jennifer
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D668 - D672
  • [70] DEVELOPMENT OF STRUCTURE-ACTIVITY RELATIONSHIP RULES FOR PREDICTING CARCINOGENIC POTENTIAL OF CHEMICALS
    WOO, YT
    LAI, DY
    ARGUS, MF
    ARCOS, JC
    [J]. TOXICOLOGY LETTERS, 1995, 79 (1-3) : 219 - 228