MRMR-EHO-Based Feature Selection Algorithm for Regression Modelling

被引:4
|
作者
Sathishkumar, V. E. [1 ]
Cho, Yongyun [2 ]
机构
[1] Hanyang Univ, Dept Ind Engn, 222 Wangsimini Ro, Seoul 06763, South Korea
[2] Sunchon Natl Univ, Dept Informat & Commun Engn, Sunchon, South Korea
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2023年 / 30卷 / 02期
基金
新加坡国家研究基金会;
关键词
data mining; elephant herding optimization; feature selection; machine learning; MRMR; GENETIC ALGORITHMS; HYBRID; OPTIMIZATION; SOLVE;
D O I
10.17559/TV-20221119040501
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the classical regression theory, a single function model is fit to a data set. In a complex and noisy domain, this process is too complex and/or not reliable. Piecewise regression models provide solutions to overcome these difficulties. The regression performance can be improved by proper feature selection. This paper proposes a feature selection technique for improving regression problems using the hybridization of filter and wrapper feature selection methods. It uses a hybrid framework of Elephant Herding Optimization (EHO) and minimum Redundancy and Maximum Relevance (mRMR). The mRMR-EHO is implemented to maximize the performance of individual regression algorithms and the results are provided in this research. In this paper, the effectiveness of CUBIST and mRMR-EHO feature selection using six fine grained data from small-sized data to big data is empirically demonstrated such as: a) Strawberry Plants Nutrient water supply, b) Steel Industry Energy Consumption, c) Seoul Bike Sharing Demand, d) Seoul Bike Trip duration, e) Appliances energy consumption dataset, f) Capital Bike share program data the results show a marginal increase in performance even to a very large scale. All 6 datasets were pre-processed well for building the models. The empirical results are based on the following algorithms: a) Generalized Linear Regression, b) K nearest neighbour, c) Random Forest, d) Support Vector Machine, e) Gradient Boosting Machine, f) CUBIST. Their performances are compared, and the best-performing model is selected. Ultimately, this paper puts forth that the mRMR-EHO-based feature selection with the rule-based CUBIST model for regression can be used as an effective tool for predictive data modelling in various domains.
引用
收藏
页码:574 / 583
页数:10
相关论文
共 50 条
  • [31] A Novel Auction-Based Optimization Algorithm and Its Application in Rough Set Feature Selection
    Jaddi, Najmeh Sadat
    Abdullah, Salwani
    IEEE ACCESS, 2021, 9 : 106501 - 106514
  • [32] A local information-based feature-selection algorithm for data regression
    Peng, Xinjun
    Xu, Dong
    PATTERN RECOGNITION, 2013, 46 (09) : 2519 - 2530
  • [33] Genetic Algorithm-based Feature Selection for Depression Scale Prediction
    Lee, Seung-Ju
    Moon, Hyun-Ji
    Kim, Da-Jung
    Yoon, Yourim
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION (GECCCO'19 COMPANION), 2019, : 65 - 66
  • [34] A Novel Wrapper-Based Optimization Algorithm for the Feature Selection and Classification
    Talpur, Noureen
    Abdulkadir, Said Jadid
    Hasan, Mohd Hilmi
    Alhussian, Hitham
    Alwadain, Ayed
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (03): : 5799 - 5820
  • [35] Improved Salp Swarm Algorithm based on opposition based learning and novel local search algorithm for feature selection
    Tubishat, Mohammad
    Idris, Norisma
    Shuib, Liyana
    Abushariah, Mohammad A. M.
    Mirjalili, Seyedali
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 145
  • [36] MRMR-SSA: a hybrid approach for optimal feature selection
    Monalisha Mahapatra
    Santosh Kumar Majhi
    Sunil Kumar Dhal
    Evolutionary Intelligence, 2022, 15 : 2017 - 2036
  • [37] Minimizing Probing Cost With mRMR Feature Selection in Network Monitoring
    Fan, XiaoBo
    Li, Xingming
    IEEE COMMUNICATIONS LETTERS, 2017, 21 (11) : 2400 - 2403
  • [38] Filtering Feature Selection Algorithm based on Fusion Strategy
    Xiao, Yufei
    Liu, Tianhe
    2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 675 - 680
  • [39] Feature selection schema based on game theory and biology migration algorithm for regression problems
    Mohammad Masoud Javidi
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 303 - 342
  • [40] Feature selection schema based on game theory and biology migration algorithm for regression problems
    Javidi, Mohammad Masoud
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2021, 12 (02) : 303 - 342