MRMR-EHO-Based Feature Selection Algorithm for Regression Modelling

被引:4
|
作者
Sathishkumar, V. E. [1 ]
Cho, Yongyun [2 ]
机构
[1] Hanyang Univ, Dept Ind Engn, 222 Wangsimini Ro, Seoul 06763, South Korea
[2] Sunchon Natl Univ, Dept Informat & Commun Engn, Sunchon, South Korea
来源
TEHNICKI VJESNIK-TECHNICAL GAZETTE | 2023年 / 30卷 / 02期
基金
新加坡国家研究基金会;
关键词
data mining; elephant herding optimization; feature selection; machine learning; MRMR; GENETIC ALGORITHMS; HYBRID; OPTIMIZATION; SOLVE;
D O I
10.17559/TV-20221119040501
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the classical regression theory, a single function model is fit to a data set. In a complex and noisy domain, this process is too complex and/or not reliable. Piecewise regression models provide solutions to overcome these difficulties. The regression performance can be improved by proper feature selection. This paper proposes a feature selection technique for improving regression problems using the hybridization of filter and wrapper feature selection methods. It uses a hybrid framework of Elephant Herding Optimization (EHO) and minimum Redundancy and Maximum Relevance (mRMR). The mRMR-EHO is implemented to maximize the performance of individual regression algorithms and the results are provided in this research. In this paper, the effectiveness of CUBIST and mRMR-EHO feature selection using six fine grained data from small-sized data to big data is empirically demonstrated such as: a) Strawberry Plants Nutrient water supply, b) Steel Industry Energy Consumption, c) Seoul Bike Sharing Demand, d) Seoul Bike Trip duration, e) Appliances energy consumption dataset, f) Capital Bike share program data the results show a marginal increase in performance even to a very large scale. All 6 datasets were pre-processed well for building the models. The empirical results are based on the following algorithms: a) Generalized Linear Regression, b) K nearest neighbour, c) Random Forest, d) Support Vector Machine, e) Gradient Boosting Machine, f) CUBIST. Their performances are compared, and the best-performing model is selected. Ultimately, this paper puts forth that the mRMR-EHO-based feature selection with the rule-based CUBIST model for regression can be used as an effective tool for predictive data modelling in various domains.
引用
收藏
页码:574 / 583
页数:10
相关论文
共 50 条
  • [21] PRESERVING COMMUNITY FEATURE EXTRACTION AND MRMR FEATURE SELECTION FOR LINK CLASSIFICATION IN COMPLEX NETWORKS
    Wu, Jie-Hua
    Zhou, Bei
    Shen, Jing
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 1, 2018, : 215 - 221
  • [22] Stateful MapReduce Framework for mRMR Feature Selection Using Horizontal Partitioning
    Yelleti, Vivek
    Prasad, P. S. V. S. Sai
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 317 - 327
  • [23] mRMR-PSO: A Hybrid Feature Selection Technique with a Multiobjective Approach for Sign Language Recognition
    BansalnAff, Sandhya Rani
    Wadhawan, Savita
    Goel, Rajeev
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (08) : 10365 - 10380
  • [24] Late Acceptance Hill Climbing Based Social Ski Driver Algorithm for Feature Selection
    Chatterjee, Bitanu
    Bhattacharyya, Trinav
    Ghosh, Kushal Kanti
    Singh, Pawan Kumar
    Geem, Zong Woo
    Sarkar, Ram
    IEEE ACCESS, 2020, 8 (08): : 75393 - 75408
  • [25] A Binary Waterwheel Plant Optimization Algorithm for Feature Selection
    Alhussan, Amel Ali
    Abdelhamid, Abdelaziz A.
    El-Kenawy, El-Sayed M.
    Ibrahim, Abdelhameed
    Eid, Marwa Metwally
    Khafaga, Doaa Sami
    Ahmed, Ayman Em
    IEEE ACCESS, 2023, 11 : 94227 - 94251
  • [26] Deluge based Genetic Algorithm for feature selection
    Guha, Ritam
    Ghosh, Manosij
    Kapri, Souvik
    Shaw, Sushant
    Mutsuddi, Shyok
    Bhateja, Vikrant
    Sarkar, Ram
    EVOLUTIONARY INTELLIGENCE, 2021, 14 (02) : 357 - 367
  • [27] Gene selection algorithm by combining reliefF and mRMR
    Yi Zhang
    Chris Ding
    Tao Li
    BMC Genomics, 9
  • [28] Feature selection algorithm based on P systems
    Song, Hongping
    Huang, Yourui
    Song, Qi
    Han, Tao
    Xu, Shanyong
    NATURAL COMPUTING, 2023, 22 (01) : 149 - 159
  • [29] A fast and novel approach based on grouping and weighted mRMR for feature selection and classification of protein sequence data
    Kaur, Kiranpreet
    Patil, Nagamma
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2020, 23 (01) : 47 - 61
  • [30] A Feature Selection Algorithm Based on Heuristic Decomposition
    Cavique, Luis
    Mendes, Armando B.
    Martiniano, Hugo F. M. C.
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017), 2017, 10423 : 525 - 536