Improving grasshopper optimization algorithm for hyperparameters estimation and feature selection in support vector regression

被引:44
作者
Algamal, Zakariya Yahya [1 ]
Qasim, Maimoonah Khalid [2 ]
Lee, Muhammad Hisyam [3 ]
Ali, Haithem Taha Mohammad [4 ]
机构
[1] Univ Mosul, Dept Stat & Informat, Mosul, Iraq
[2] Univ Mosul, Dept Gen Sci, Mosul, Iraq
[3] Univ Teknol Malaysia, Fac Sci, Dept Math Sci, Johor Baharu, Malaysia
[4] Nawroz Univ, Coll Comp & Informat Technol, Dahuk, Kurdistan Regio, Iraq
关键词
Grasshopper optimization algorithm; Evolutionary algorithm; SVR; Feature selection; HIGH-DIMENSIONAL QSAR; HYBRID GENETIC ALGORITHM; LOGISTIC-REGRESSION; CORROSION INHIBITION; VARIABLE SELECTION; FIREFLY ALGORITHM; MODEL; CLASSIFICATION; PARAMETERS; PREDICTION;
D O I
10.1016/j.chemolab.2020.104196
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
High-dimensionality is one of the major problems which affect the quality of the classification and prediction modeling. Support vector regression has been applied in several real problems. However, it is usually needed to tune manually the hyperparameters.In addition, SVR cannot perform feature selection. Nature-inspired algorithms have been used as a feature selection and as hyperparameters estimation procedure. In this paper, an improving grasshopper optimization algorithm (GOA) is proposed by adapting a new function of the main controlling parameter of GOA to enhance the exploration and exploitation capability of GOA. This improving is utilized to optimize the hyperparameters of the SVR with embedding the feature selection simultaneously. Experimental results, obtained by running on four datasets, show that our proposed algorithm performs better than crossvalidation method, in terms of prediction, number of selected features, and running time. Besides, the experimental results of the proposed improving confirm the efficiency of the proposed algorithm in improving the prediction performance and computational time compared to other nature-inspired algorithms, which proves the ability of GOA in searching for the best hyperparameters values and selecting the most informative features for prediction tasks.
引用
收藏
页数:7
相关论文
共 56 条
  • [1] A QSAR model for predicting antidiabetic activity of dipeptidyl peptidase-IV inhibitors by enhanced binary gravitational search algorithm
    Al-Fakih, A. M.
    Algamal, Z. Y.
    Lee, M. H.
    Aziz, M.
    Ali, H. T. M.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2019, 30 (06) : 403 - 416
  • [2] QSAR classification model for diverse series of antifungal agents based on improved binary differential search algorithm
    Al-Fakih, A. M.
    Algamal, Z. Y.
    Lee, M. H.
    Aziz, M.
    Ali, H. T. M.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2019, 30 (02) : 131 - 143
  • [3] Al-Fakih AM, 2015, INT J ELECTROCHEM SC, V10, P3568
  • [4] Quantitative structure-activity relationship model for prediction study of corrosion inhibition efficiency using two-stage sparse multiple linear regression
    Al-Fakih, Abdo Mohammed
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    Abdallah, Hassan H.
    Maarof, Hasmerya
    Aziz, Madzlan
    [J]. JOURNAL OF CHEMOMETRICS, 2016, 30 (07) : 361 - 368
  • [5] Tuning parameter estimation in SCAD-support vector machine using firefly algorithm with application in gene selection and cancer classification
    Al-Thanoon, Niam Abdulmunim
    Qasim, Omar Saber
    Algamal, Zakariya Yahya
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 103 : 262 - 268
  • [6] A QSAR classification model for neuraminidase inhibitors of influenza A viruses (H1N1) based on weighted penalized support vector machine
    Algamal, Z. Y.
    Qasim, M. K.
    Ali, H. T. M.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2017, 28 (05) : 415 - 426
  • [7] A new adaptive L1-norm for optimal descriptor selection of high-dimensional QSAR classification model for anti-hepatitis C virus activity of thiourea derivatives
    Algamal, Z. Y.
    Lee, M. H.
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2017, 28 (01) : 75 - 90
  • [8] A two-stage sparse logistic regression for optimal gene selection in high-dimensional microarray data classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    [J]. ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2019, 13 (03) : 753 - 771
  • [9] Algamal ZY, 2017, ELECTRON J APPL STAT, V10, P242, DOI 10.1285/i20705948v10n1p242
  • [10] Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 : 136 - 145