A New Alternating Suboptimal Dynamic Programming Algorithm with Applications for Feature Selection

被引:1
|
作者
Podgorelec, David [1 ]
Zalik, Borut [1 ]
Mongus, Domen [1 ]
Vlahek, Dino [1 ]
机构
[1] Univ Maribor, Fac Elect Engn & Comp Sci, Koroska Cesta 46, SI-2000 Maribor, Slovenia
关键词
dynamic programming; suboptimal solution; feature selection; machine learning; MUTUAL INFORMATION; CLASSIFICATION; RELEVANCE; SEARCH;
D O I
10.3390/math12131987
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Feature selection is predominantly used in machine learning tasks, such as classification, regression, and clustering. It selects a subset of features (relevant attributes of data points) from a larger set that contributes as optimally as possible to the informativeness of the model. There are exponentially many subsets of a given set, and thus, the exhaustive search approach is only practical for problems with at most a few dozen features. In the past, there have been attempts to reduce the search space using dynamic programming. However, models that consider similarity in pairs of features alongside the quality of individual features do not provide the required optimal substructure. As a result, algorithms, which we will call suboptimal dynamic programming algorithms, find a solution that may deviate significantly from the optimal one. In this paper, we propose an iterative dynamic programming algorithm, which invertsthe order of feature processing in each iteration. Such an alternating approach allows for improving the optimization function by using the score from the previous iteration to estimate the contribution of unprocessed features. The iterative process is proven to converge and terminates when the solution does not change in three successive iterations or when the number of iterations reaches the threshold. Results in more than 95% of tests align with those of the exhaustive search approach, being competitive and often superior to the reference greedy approach. Validation was carried out by comparing the scores of output feature subsets and examining the accuracy of different classifiers learned on these features across nine real-world applications, considering different scenarios with various numbers of features and samples. In the context of feature selection, the proposed algorithm can be characterized as a robust filter method that can improve machine learning models regardless of dataset size. However, we expect that the idea of alternating suboptimal optimization will soon be generalized to tasks beyond feature selection.
引用
收藏
页数:22
相关论文
共 50 条
  • [21] New algorithm for dynamic programming on regular arrays
    Djamegni, Clementin Tayou
    Tchuente, Maurice
    Parallel Processing Letters, 2000, 10 (01): : 15 - 27
  • [22] Dynamic programming optimization algorithm applied in test case selection
    Banias, Ovidiu
    2018 13TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2018, : 106 - 109
  • [23] Quadratic Programming Feature Selection
    Rodriguez-Lujan, Irene
    Huerta, Ramon
    Elkan, Charles
    Santa Cruz, Carlos
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1491 - 1516
  • [24] Quadratic programming feature selection
    Rodriguez-Lujan, Irene
    Huerta, Ramon
    Elkan, Charles
    Cruz, Carlos Santa
    Journal of Machine Learning Research, 2010, 11 : 1491 - 1516
  • [25] Classification of ECG beats by using a fast least square support vector machines with a dynamic programming feature selection algorithm
    Acir, N
    NEURAL COMPUTING & APPLICATIONS, 2005, 14 (04): : 299 - 309
  • [26] Classification of ECG beats by using a fast least square support vector machines with a dynamic programming feature selection algorithm
    Nurettin Acır
    Neural Computing & Applications, 2005, 14 : 299 - 309
  • [27] A new standard error based artificial bee colony algorithm and its applications in feature selection
    Hanbay, Kazim
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (07) : 4554 - 4567
  • [28] FEATURE SELECTION ALGORITHM BASED ON CONDITIONAL DYNAMIC MUTUAL INFORMATION
    Wang Liping
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2015, 8 (01): : 316 - 337
  • [29] Dynamic Feature Selection Based on Clustering Algorithm and Individual Similarity
    Dantas, Carine A.
    Nunes, Romulo O.
    Canuto, Anne M. P.
    Xavier-Junior, Joao C.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 467 - 474
  • [30] Applications of dynamic feature selection and clustering methods to medical diagnosis
    Ershadi, Mohammad Mahdi
    Seifi, Abbas
    APPLIED SOFT COMPUTING, 2022, 126