Selecting Optimal Feature Set in High-Dimensional Data by Swarm Search

被引:16
|
作者
Fong, Simon [1 ]
Zhuang, Yan [1 ]
Tang, Rui [1 ]
Yang, Xin-She [2 ]
Deb, Suash [3 ]
机构
[1] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
[2] Middlesex Univ, Fac Sci & Technol, London N17 8HR, England
[3] Cambridge Inst Technol, Dept Comp Sci & Engn, Ranchi, Bihar, India
关键词
D O I
10.1155/2013/590614
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Selecting the right set of features from data of high dimensionality for inducing an accurate classification model is a tough computational challenge. It is almost a NP-hard problem as the combinations of features escalate exponentially as the number of features increases. Unfortunately in data mining, as well as other engineering applications and bioinformatics, some data are described by a long array of features. Many feature subset selection algorithms have been proposed in the past, but not all of them are effective. Since it takes seemingly forever to use brute force in exhaustively trying every possible combination of features, stochastic optimization may be a solution. In this paper, we propose a new feature selection scheme called Swarm Search to find an optimal feature set by using metaheuristics. The advantage of Swarm Search is its flexibility in integrating any classifier into its fitness function and plugging in any metaheuristic algorithm to facilitate heuristic search. Simulation experiments are carried out by testing the Swarm Search over some high-dimensional datasets, with different classification algorithms and various metaheuristic algorithms. The comparative experiment results show that Swarm Search is able to attain relatively low error rates in classification without shrinking the size of the feature subset to its minimum.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] An Efficient Binary Sand Cat Swarm Optimization for Feature Selection in High-Dimensional Biomedical Data
    Pashaei, Elnaz
    BIOENGINEERING-BASEL, 2023, 10 (10):
  • [22] Surrogate Sample-Assisted Particle Swarm Optimization for Feature Selection on High-Dimensional Data
    Song, Xianfang
    Zhang, Yong
    Gong, Dunwei
    Liu, Hui
    Zhang, Wanqiu
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2023, 27 (03) : 595 - 609
  • [23] Search space division method for wrapper feature selection on high-dimensional data classification
    Chaudhuri, Abhilasha
    KNOWLEDGE-BASED SYSTEMS, 2024, 291
  • [24] Feature selection based on dynamic crow search algorithm for high-dimensional data classification
    Jiang, He
    Yang, Ye
    Wan, Qiuying
    Dong, Yao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 250
  • [25] Particle Swarm Optimisation for Feature Selection and Weighting in High-Dimensional Clustering
    O'Neill, Damien
    Lensen, Andrew
    Xue, Bing
    Zhang, Mengjie
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 173 - 180
  • [26] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Gu, Shenkai
    Cheng, Ran
    Jin, Yaochu
    SOFT COMPUTING, 2018, 22 (03) : 811 - 822
  • [27] Feature selection for high-dimensional classification using a competitive swarm optimizer
    Shenkai Gu
    Ran Cheng
    Yaochu Jin
    Soft Computing, 2018, 22 : 811 - 822
  • [28] Optimal Sets of Projections of High-Dimensional Data
    Lehmann, Dirk J.
    Theisel, Holger
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2016, 22 (01) : 609 - 618
  • [29] Selecting an Optimal Feature Set for Stance Detection
    Vychegzhanin, Sergey
    Razova, Elena
    Kotelnikov, Evgeny
    Milov, Vladimir
    ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS, AIST 2019, 2019, 11832 : 242 - 253
  • [30] Feature selection for high-dimensional data using a multivariate search space reduction strategy based scatter search
    Garcia-Torres, Miguel
    JOURNAL OF HEURISTICS, 2025, 31 (01)