Monte Carlo Tree Search-Based Recursive Algorithm for Feature Selection in High-Dimensional Datasets

被引:1
|
作者
Chaudhry, Muhammad Umar [1 ,2 ]
Yasir, Muhammad [3 ]
Asghar, Muhammad Nabeel [4 ]
Lee, Jee-Hyong [2 ]
机构
[1] AiHawks, Multan 60000, Pakistan
[2] Sungkyunkwan Univ, Dept Elect & Comp Engn, Suwon 16419, South Korea
[3] Univ Engn & Technol Lahore, Dept Comp Sci, Faisalabad Campus, Faisalabad 38000, Pakistan
[4] Bahauddin Zakariya Univ, Dept Comp Sci, Multan 60000, Pakistan
基金
新加坡国家研究基金会;
关键词
feature selection; dimensionality reduction; R-MOTiFS; Monte Carlo Tree Search (MCTS); heuristic feature selection; PARTICLE SWARM OPTIMIZATION; SUBSET-SELECTION; CLASSIFICATION; COLONY;
D O I
10.3390/e22101093
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
The complexity and high dimensionality are the inherent concerns of big data. The role of feature selection has gained prime importance to cope with the issue by reducing dimensionality of datasets. The compromise between the maximum classification accuracy and the minimum dimensions is as yet an unsolved puzzle. Recently, Monte Carlo Tree Search (MCTS)-based techniques have been invented that have attained great success in feature selection by constructing a binary feature selection tree and efficiently focusing on the most valuable features in the features space. However, one challenging problem associated with such approaches is a tradeoff between the tree search and the number of simulations. In a limited number of simulations, the tree might not meet the sufficient depth, thus inducing biasness towards randomness in feature subset selection. In this paper, a new algorithm for feature selection is proposed where multiple feature selection trees are built iteratively in a recursive fashion. The state space of every successor feature selection tree is less than its predecessor, thus increasing the impact of tree search in selecting best features, keeping the MCTS simulations fixed. In this study, experiments are performed on 16 benchmark datasets for validation purposes. We also compare the performance with state-of-the-art methods in literature both in terms of classification accuracy and the feature selection ratio.
引用
收藏
页码:1 / 15
页数:15
相关论文
共 50 条
  • [41] Analytical and Experimental Study of Filter Feature Selection Algorithms for High-dimensional Datasets
    Pino, Adrian
    Morell, Carlos
    PROCEEDINGS OF THE FOURTH INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY, KNOWLEDGE MANAGEMENT AND DECISION SUPPORT (EUREKA-2013), 2013, 51 : 339 - 349
  • [42] A filter feature selection for high-dimensional data
    Janane, Fatima Zahra
    Ouaderhman, Tayeb
    Chamlal, Hasna
    JOURNAL OF ALGORITHMS & COMPUTATIONAL TECHNOLOGY, 2023, 17
  • [43] Whale Optimisation Algorithm for high-dimensional small-instance feature selection
    Mafarja, Majdi
    Jaber, Iyad
    Ahmed, Sobhi
    Thaher, Thaer
    INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2021, 36 (02) : 80 - 96
  • [44] Prior knowledge evaluation and emphasis sampling-based evolutionary algorithm for high-dimensional medical data feature selection
    Wang, Zhilin
    Shao, Lizhi
    Heidari, Ali Asghar
    Wang, Mingjing
    Chen, Huiling
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 273
  • [45] FACO: A Novel Hybrid Feature Selection Algorithm for High-Dimensional Data Classification
    Popoola, Gideon
    Oyeniran, Kayode
    SOUTHEASTCON 2024, 2024, : 61 - 68
  • [46] Hybrid binary Coral Reefs Optimization algorithm with Simulated Annealing for Feature Selection in high-dimensional biomedical datasets
    Yan, Chaokun
    Ma, Jingjing
    Luo, Huimin
    Patel, Ashutosh
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 184 : 102 - 111
  • [47] Bi-objective feature selection in high-dimensional datasets using improved binary chimp optimization algorithm
    Al-qudah, Nour Elhuda A.
    Abed-alguni, Bilal H.
    Barhoush, Malek
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (12) : 6107 - 6148
  • [48] A Hybrid Algorithm Based on Binary Chemical Reaction Optimization and Tabu Search for Feature Selection of High-Dimensional Biomedical Data
    Chaokun Yan
    Jingjing Ma
    Huimin Luo
    Jianxin Wang
    Tsinghua Science and Technology, 2018, 23 (06) : 733 - 743
  • [49] A Hybrid Algorithm Based on Binary Chemical Reaction Optimization and Tabu Search for Feature Selection of High-Dimensional Biomedical Data
    Yan, Chaokun
    Ma, Jingjing
    Luo, Huimin
    Wang, Jianxin
    TSINGHUA SCIENCE AND TECHNOLOGY, 2018, 23 (06) : 733 - 743
  • [50] Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search
    Mandal, Ashis Kumar
    Nadim, MD.
    Saha, Hasi
    Sultana, Tangina
    Hossain, Md. Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2024, 12 : 62341 - 62357