A New Alternating Suboptimal Dynamic Programming Algorithm with Applications for Feature Selection

被引:1
|
作者
Podgorelec, David [1 ]
Zalik, Borut [1 ]
Mongus, Domen [1 ]
Vlahek, Dino [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Koroska Cesta 46, SI-2000 Maribor, Slovenia
关键词
dynamic programming; suboptimal solution; feature selection; machine learning; MUTUAL INFORMATION; CLASSIFICATION; RELEVANCE; SEARCH;
D O I
10.3390/math12131987
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Feature selection is predominantly used in machine learning tasks, such as classification, regression, and clustering. It selects a subset of features (relevant attributes of data points) from a larger set that contributes as optimally as possible to the informativeness of the model. There are exponentially many subsets of a given set, and thus, the exhaustive search approach is only practical for problems with at most a few dozen features. In the past, there have been attempts to reduce the search space using dynamic programming. However, models that consider similarity in pairs of features alongside the quality of individual features do not provide the required optimal substructure. As a result, algorithms, which we will call suboptimal dynamic programming algorithms, find a solution that may deviate significantly from the optimal one. In this paper, we propose an iterative dynamic programming algorithm, which invertsthe order of feature processing in each iteration. Such an alternating approach allows for improving the optimization function by using the score from the previous iteration to estimate the contribution of unprocessed features. The iterative process is proven to converge and terminates when the solution does not change in three successive iterations or when the number of iterations reaches the threshold. Results in more than 95% of tests align with those of the exhaustive search approach, being competitive and often superior to the reference greedy approach. Validation was carried out by comparing the scores of output feature subsets and examining the accuracy of different classifiers learned on these features across nine real-world applications, considering different scenarios with various numbers of features and samples. In the context of feature selection, the proposed algorithm can be characterized as a robust filter method that can improve machine learning models regardless of dataset size. However, we expect that the idea of alternating suboptimal optimization will soon be generalized to tasks beyond feature selection.
引用
收藏
页数:22
相关论文
共 50 条
  • [31] SUBOPTIMAL ALGORITHM FOR IDENTIFICATION OF DYNAMIC OBJECTS
    KRASOVSKIY, AA
    ENGINEERING CYBERNETICS, 1977, 15 (05): : 109 - 114
  • [32] A New Hybrid Seagull Optimization Algorithm for Feature Selection
    Jia, Heming
    Xing, Zhikai
    Song, Wenlong
    IEEE ACCESS, 2019, 7 : 49614 - 49631
  • [33] NANO: A New Supervised Algorithm for Feature Selection with Discretization
    Senthilkumar, J.
    Manjula, D.
    Krishnamoorthy, R.
    2009 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE, VOLS 1-3, 2009, : 1515 - +
  • [34] A new transferred feature selection algorithm for customer identification
    Bing Zhu
    Yongge Niu
    Jin Xiao
    Bart Baesens
    Neural Computing and Applications, 2017, 28 : 2593 - 2603
  • [35] A New Feature Selection Algorithm for Stream Data Classification
    Wankhade, Kapil
    Rane, Dhiraj
    Thool, Ravindra
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1843 - 1848
  • [36] A new and fast rival genetic algorithm for feature selection
    Too, Jingwei
    Abdullah, Abdul Rahim
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (03): : 2844 - 2874
  • [37] Electricity price forecasting with a new feature selection algorithm
    Keynia, Farshid
    Amjady, Nima
    JOURNAL OF ENERGY MARKETS, 2008, 1 (04) : 47 - 63
  • [38] New feature selection algorithm based on potential difference
    Liu, Guangyuan
    Liu, Yu
    Dong, Liyan
    Yuan, Senmiao
    Li, Yongli
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 566 - +
  • [39] A new and fast rival genetic algorithm for feature selection
    Jingwei Too
    Abdul Rahim Abdullah
    The Journal of Supercomputing, 2021, 77 : 2844 - 2874
  • [40] A new transferred feature selection algorithm for customer identification
    Zhu, Bing
    Niu, Yongge
    Xiao, Jin
    Baesens, Bart
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (09): : 2593 - 2603