Optimizing Sequential Forward Selection on Classification Using Genetic Algorithm

被引:0
|
作者
Chotchantarakun K. [1 ]
机构
[1] Department of Information Studies, Faculty of Humanities and Social Science, Burapha University, 169 Longhaad Bangsaen Rd, Saensuk, Mueang, Chonburi
来源
Informatica (Slovenia) | 2023年 / 47卷 / 09期
关键词
classification accuracy; data mining; genetic algorithm; optimization; sequential feature selection;
D O I
10.31449/inf.v47i9.4964
中图分类号
学科分类号
摘要
Regarding the digital transformation of modern technologies, the amount of data increases significantly resulting in novel knowledge discovery techniques in Data Analytic and Data Mining. These data usually consist of noises or non-informative features that affect the analysis results. The features-eliminating approaches have been studied extensively in the past few decades name feature selection. It is a significant preprocessing step of the mining process, which selects only the informative features from the original feature set. These selected features improve the learning model efficiency. This study proposes a forward sequential feature selection method called Forward Selection with Genetic Algorithm (FS-GA). FS-GA consists of three major steps. First, it creates the preliminarily selected subsets. Second, it provides an improvement on the previous subsets. Third, it optimizes the selected subset using the genetic algorithm. Hence, it maximizes the classification accuracy during the feature addition. We performed experiments based on ten standard UCI datasets using three popular classification models including the Decision Tree, Naive Bayes, and K-Nearest Neighbour classifiers. The results are compared with the state-of-the-art methods. FS-GA has shown the best results against the other sequential forward selection methods for all the tested datasets with O(n2) time complexity. © 2023 Slovene Society Informatika. All rights reserved.
引用
收藏
页码:81 / 90
页数:9
相关论文
共 50 条
  • [21] Classification of mass and normal breast tissue: Feature selection using a genetic algorithm
    Sahiner, B
    Chan, HP
    Petrick, N
    Helvie, MA
    Goodsitt, MM
    Adler, DD
    DIGITAL MAMMOGRAPHY '96, 1996, 1119 : 379 - 384
  • [22] Feature Subset Selection Using Genetic Algorithm with Aggressive Mutation for Classification Problem
    Jermaine Pontiveros, Marc
    Solano, Geoffrey A.
    Diaz, Joey Mark S.
    Caro, Jaime D. L.
    2021 IEEE REGION 10 CONFERENCE (TENCON 2021), 2021, : 347 - 352
  • [23] Gene selection for classification of cancers using probabilistic model building genetic algorithm
    Paul, TK
    Iba, H
    BIOSYSTEMS, 2005, 82 (03) : 208 - 225
  • [24] Feature Selection Algorithm for Intrusions Detection System using Sequential Forward Search and Random Forest Classifier
    Lee, Jinlee
    Park, Dooho
    Lee, Changhoon
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2017, 11 (10): : 5112 - 5128
  • [25] Feature Selection Based on a Genetic Algorithm for Optimizing Weaning Success
    Rosati, Samanta
    Scotto, Andrea
    Fanelli, Vito
    Balestra, Gabriella
    CARING IS SHARING-EXPLOITING THE VALUE IN DATA FOR HEALTH AND INNOVATION-PROCEEDINGS OF MIE 2023, 2023, 302 : 566 - 570
  • [26] A Data Classification Method Using Genetic Algorithm and K-Means Algorithm with Optimizing Initial Cluster Center
    Shi, Haobin
    Xu, Meng
    2018 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION ENGINEERING TECHNOLOGY (CCET), 2018, : 224 - 228
  • [27] Optimizing Online Shopping using Genetic Algorithm
    Verma, Sahil
    Sinha, Akash
    Kumar, Prabhat
    Maitin, Ajay
    2020 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTER TECHNOLOGIES (ICICT 2020), 2020, : 271 - 275
  • [28] Optimizing service distributions using a genetic algorithm
    Jurasovic, Kresimir
    Kusek, Mario
    KNOWLEDGE - BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2008, 5177 : 158 - 165
  • [29] DAG-SVM based infant cry classification system using sequential forward floating feature selection
    Chuan-Yu Chang
    Chuan-Wang Chang
    S. Kathiravan
    Chen Lin
    Szu-Ta Chen
    Multidimensional Systems and Signal Processing, 2017, 28 : 961 - 976
  • [30] DAG-SVM based infant cry classification system using sequential forward floating feature selection
    Chang, Chuan-Yu
    Chang, Chuan-Wang
    Kathiravan, S.
    Lin, Chen
    Chen, Szu-Ta
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2017, 28 (03) : 961 - 976