Stacked Ensemble for Bioactive Molecule Prediction

被引:11
|
作者
Petinrin, Olutomilayo Olayemi [1 ,2 ]
Saeed, Faisal [1 ,3 ]
机构
[1] Univ Teknol Malaysia Johor Bahru, Informat Syst Dept, Fac Comp, Skudai 81310, Malaysia
[2] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[3] Taibah Univ, Coll Comp Sci & Engn, Medina 42353, Saudi Arabia
关键词
Bioactive molecule prediction; chemoinformatics; drug discovery; ensemble; stacked ensemble; VOTING-BASED CLASSIFICATION; RANDOM FOREST; OPTIMIZATION; CLASSIFIERS; EXTRACTION;
D O I
10.1109/ACCESS.2019.2945422
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioactive molecular compounds are essential for drug discovery. The biological activity of these compounds needs to be predicted as this is used to determine the drug-target ability. As ineffective drugs are discarded after production, leading to resource and time wastage, it is important to predict bioactive molecules with models having high predictive performance. This study utilizes the stacked ensemble which uses the prediction of multiple base classifiers as features, used to train a meta classifier which makes the final prediction. Using three datasets DS1, DS2, and DS3 gotten from MDL Drug Data Report (MDDR) database, the performance of stacked ensemble was compared to three other ensembles: adaboost, bagging, and vote ensemble, based on different evaluation criteria and also a statistical method, Kendall's W test. The accuracy of Stacked ensemble ranged from 96.7002%, 98.2260% and 94.9007% for the three datasets respectively, although Vote had the best accuracy using dataset DS2 which consist of structurally homogeneous bioactive molecules. Also, using Kendall's W test to rank the ensembles, Stacked ensemble was ranked best with datasets DS1 and DS3, with both having a mean average of 4.00 and an overall level of agreement, W, of 0.986 and 1.000 respectively. Using dataset DS2, it was ranked after Vote and Adaboost with mean average of 2.33 and an overall level of agreement, W of 0.857. Stacked ensemble is recommended for the prediction of heterogeneous bioactive molecules during drug discovery and can also be implemented in other research areas.
引用
收藏
页码:153952 / 153957
页数:6
相关论文
共 50 条
  • [1] Bioactive molecule prediction using majority voting-based ensemble method
    Petinrin, Olutomilayo Olayemi
    Saeed, Faisal
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 383 - 392
  • [2] Stacked Ensemble Regression Model for Prediction of Furan
    Faraji, Mohammad Amin
    Shooshtari, Alireza
    El-Hag, Ayman
    ENERGIES, 2023, 16 (22)
  • [3] Stacked regression ensemble for cancer class prediction
    Sehgal, MSB
    Gondal, I
    Dooley, L
    2005 3rd IEEE International Conference on Industrial Informatics (INDIN), 2005, : 831 - 835
  • [4] Mortality prediction in ICU Using a Stacked Ensemble Model
    Ren, Na
    Zhao, Xin
    Zhang, Xin
    Computational and Mathematical Methods in Medicine, 2022, 2022
  • [5] A stacked ensemble machine learning approach for the prediction of diabetes
    Oliullah, Khondokar
    Rasel, Mahedi Hasan
    Islam, Md. Manzurul
    Islam, Md. Reazul
    Wadud, Md. Anwar Hussen
    Whaiduzzaman, Md.
    JOURNAL OF DIABETES AND METABOLIC DISORDERS, 2024, 23 (01) : 603 - 617
  • [6] Brain Stroke Prediction Using Stacked Ensemble Model
    Gunasekaran, Hemalatha
    Gladys, Angelin
    Kanmani, Deepa
    Macedo, Rex
    Blessing, N. R. Wilfred
    JURNAL KEJURUTERAAN, 2024, 36 (04): : 1759 - 1768
  • [7] Heart disease prediction using stacked ensemble technique
    Vasudev, R. Aravind
    Anitha, B.
    Manikandan, G.
    Karthikeyan, B.
    Ravi, Logesh
    Subramaniyaswamy, V
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (06) : 8249 - 8257
  • [8] Heterogeneous stacked ensemble classifier for software defect prediction
    Goyal, Somya
    Bhatia, Pradeep Kumar
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (26) : 37033 - 37055
  • [9] Stacked Ensemble Model for Tropical Cyclone Path Prediction
    Sattar, Kalim
    Zahra, Syeda Zoupash
    Faheem, Muhammad
    Missen, Malik Muhammad Saad
    Bashir, Rab Nawaz
    Abbas, Muhammad Zahid
    IEEE ACCESS, 2023, 11 : 69512 - 69521
  • [10] Heterogeneous stacked ensemble classifier for software defect prediction
    Somya Goyal
    Pradeep Kumar Bhatia
    Multimedia Tools and Applications, 2022, 81 : 37033 - 37055