Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets

被引:1
|
作者
Chaudhry, Muhammad Umar [1 ,2 ]
Yasir, Muhammad [3 ]
Asghar, Muhammad Nabeel [4 ]
Lee, Jee-Hyong [2 ]
机构
[1] AiHawks, Multan 60000, Pakistan
[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, South Korea
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Faisalabad Campus, Faisalabad 38000, Pakistan
[4] Bahauddin Zakariya Univ, Dept Comp Sci, Multan 60000, Pakistan
基金
新加坡国家研究基金会;
关键词
feature selection; dimensionality reduction; R-MOTiFS; Monte Carlo Tree Search (MCTS); heuristic feature selection; PARTICLE SWARM OPTIMIZATION; SUBSET-SELECTION; CLASSIFICATION; COLONY;
D O I
10.3390/e22101093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The complexity and high dimensionality are the inherent concerns of big data. The role of feature selection has gained prime importance to cope with the issue by reducing dimensionality of datasets. The compromise between the maximum classification accuracy and the minimum dimensions is as yet an unsolved puzzle. Recently, Monte Carlo Tree Search (MCTS)-based techniques have been invented that have attained great success in feature selection by constructing a binary feature selection tree and efficiently focusing on the most valuable features in the features space. However, one challenging problem associated with such approaches is a tradeoff between the tree search and the number of simulations. In a limited number of simulations, the tree might not meet the sufficient depth, thus inducing biasness towards randomness in feature subset selection. In this paper, a new algorithm for feature selection is proposed where multiple feature selection trees are built iteratively in a recursive fashion. The state space of every successor feature selection tree is less than its predecessor, thus increasing the impact of tree search in selecting best features, keeping the MCTS simulations fixed. In this study, experiments are performed on 16 benchmark datasets for validation purposes. We also compare the performance with state-of-the-art methods in literature both in terms of classification accuracy and the feature selection ratio.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [21] A GA-BASED FEATURE SELECTION AND ENSEMBLE LEARNING FOR HIGH-DIMENSIONAL DATASETS
    Xia, Pei-Yong
    Ding, Xiang-Qian
    Jiang, Bai-Ning
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 7 - +
  • [22] Particle swarm optimization algorithm based on comprehensive scoring framework for high-dimensional feature selection
    Wei, Bo
    Yang, Shanshan
    Zha, Wentao
    Deng, Li
    Huang, Jiangyi
    Su, Xiaohui
    Wang, Feng
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 95
  • [23] An evolutionary gravitational search-based feature selection
    Taradeh, Mohammad
    Mafarja, Majdi
    Heidari, Ali Asghar
    Faris, Hossam
    Aljarah, Ibrahim
    Mirjalili, Seyedali
    Fujita, Hamido
    INFORMATION SCIENCES, 2019, 497 : 219 - 239
  • [24] An Improved Gannet Optimization Algorithm Based on Opposition-Based Schemes for Feature Selection Problems in High-Dimensional Datasets
    Avinash N.
    Sinha S.K.
    Shivamurthaiah M.
    SN Computer Science, 5 (1)
  • [25] Binary Banyan tree growth optimization: A practical approach to high-dimensional feature selection
    Wu, Xian
    Fei, Minrui
    Zhou, Wenju
    Du, Songlin
    Fei, Zixiang
    Zhou, Huiyu
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [26] Copula entropy-based golden jackal optimization algorithm for high-dimensional feature selection problems
    Askr, Heba
    Abdel-Salam, Mahmoud
    Hassanien, Aboul Ella
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [27] High-dimensional feature selection via feature grouping: A Variable Neighborhood Search approach
    Garcia-Torres, Miguel
    Gomez-Vela, Francisco
    Melian-Batista, Belen
    Marcos Moreno-Vega, J.
    INFORMATION SCIENCES, 2016, 326 : 102 - 118
  • [28] A particle swarm optimization based multiobjective memetic algorithm for high-dimensional feature selection
    Juanjuan Luo
    Dongqing Zhou
    Lingling Jiang
    Huadong Ma
    Memetic Computing, 2022, 14 : 77 - 93
  • [29] A PSO Based Hybrid Feature Selection Algorithm for High-Dimensional Classification
    Binh Tran
    Zhang, Mengjie
    Xue, Bing
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 3801 - 3808
  • [30] A new search algorithm for feature selection in high-dimensional remote-sensing images
    Bruzzone, L
    Serpico, SB
    IMAGE AND SIGNAL PROCESSING FOR REMOTE SENSING IV, 1998, 3500 : 34 - 41