Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets

被引:1
|
作者
Chaudhry, Muhammad Umar [1 ,2 ]
Yasir, Muhammad [3 ]
Asghar, Muhammad Nabeel [4 ]
Lee, Jee-Hyong [2 ]
机构
[1] AiHawks, Multan 60000, Pakistan
[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, South Korea
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Faisalabad Campus, Faisalabad 38000, Pakistan
[4] Bahauddin Zakariya Univ, Dept Comp Sci, Multan 60000, Pakistan
基金
新加坡国家研究基金会;
关键词
feature selection; dimensionality reduction; R-MOTiFS; Monte Carlo Tree Search (MCTS); heuristic feature selection; PARTICLE SWARM OPTIMIZATION; SUBSET-SELECTION; CLASSIFICATION; COLONY;
D O I
10.3390/e22101093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The complexity and high dimensionality are the inherent concerns of big data. The role of feature selection has gained prime importance to cope with the issue by reducing dimensionality of datasets. The compromise between the maximum classification accuracy and the minimum dimensions is as yet an unsolved puzzle. Recently, Monte Carlo Tree Search (MCTS)-based techniques have been invented that have attained great success in feature selection by constructing a binary feature selection tree and efficiently focusing on the most valuable features in the features space. However, one challenging problem associated with such approaches is a tradeoff between the tree search and the number of simulations. In a limited number of simulations, the tree might not meet the sufficient depth, thus inducing biasness towards randomness in feature subset selection. In this paper, a new algorithm for feature selection is proposed where multiple feature selection trees are built iteratively in a recursive fashion. The state space of every successor feature selection tree is less than its predecessor, thus increasing the impact of tree search in selecting best features, keeping the MCTS simulations fixed. In this study, experiments are performed on 16 benchmark datasets for validation purposes. We also compare the performance with state-of-the-art methods in literature both in terms of classification accuracy and the feature selection ratio.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [31] An iterative SVM approach to feature selection and classification in high-dimensional datasets
    Liu, Dehua
    Qian, Hui
    Dai, Guang
    Zhang, Zhihua
    PATTERN RECOGNITION, 2013, 46 (09) : 2531 - 2537
  • [32] Feature Selection for High-Dimensional Datasets through a Novel Artificial Bee Colony Framework
    Zhang, Yuanzi
    Wang, Jing
    Li, Xiaolin
    Huang, Shiguo
    Wang, Xiuli
    ALGORITHMS, 2021, 14 (11)
  • [33] A velocity-based butterfly optimization algorithm for high-dimensional optimization and feature selection
    Long, Wen
    Xu, Ming
    Jiao, Jianjun
    Wu, Tiebin
    Tang, Mingzhu
    Cai, Shaohong
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 201
  • [34] Spatial bound whale optimization algorithm: an efficient high-dimensional feature selection approach
    Too, Jingwei
    Mafarja, Majdi
    Mirjalili, Seyedali
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23) : 16229 - 16250
  • [35] Improving Evolutionary Algorithm Performance for Feature Selection in High-Dimensional Data
    Cilia, N.
    De Stefano, C.
    Fontanella, F.
    di Freca, A. Scotto
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2018, 2018, 10784 : 439 - 454
  • [36] High-Dimensional Multimodal Feature Selection Based on Evolutionary Computation
    Ding, Zhuanlian
    Hu, Xi
    Cao, Liie
    Sun, Dengdi
    Zhang, Xingyi
    Wang, Chenxu
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2024, 58 (09): : 117 - 128
  • [37] A density-based clustering algorithm for high-dimensional data with feature selection
    Qi Xianting
    Wang Pan
    2016 2ND INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2016, : 114 - 118
  • [38] A Bi-Search Evolutionary Algorithm for High-Dimensional Bi-Objective Feature Selection
    Xu, Hang
    Xue, Bing
    Zhang, Mengjie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, 8 (05): : 3489 - 3502
  • [39] A dynamic stochastic search algorithm for high-dimensional optimization problems and its application to feature selection
    Liu, Qi
    Liu, Mengxue
    Wang, Fengde
    Xiao, Wensheng
    KNOWLEDGE-BASED SYSTEMS, 2022, 244
  • [40] Feature selection in high-dimensional classification via an adaptive multifactor evolutionary algorithm with local search
    Li, Zhihui
    Li, Hong
    Gao, Weifeng
    Xie, Jin
    Slowik, Adam
    APPLIED SOFT COMPUTING, 2025, 169