Artificial Intelligence based wrapper for high dimensional feature selection

被引:6
|
作者
Jain, Rahi [1 ]
Xu, Wei [2 ]
机构
[1] Princess Margaret Canc Res Ctr, Biostat Dept, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
High dimensional data; Wrapper feature selection; Artificial intelligence; AIWrap; Machine learning; Interaction terms; REDUCTION; REGRESSION; LASSO; SMOKE;
D O I
10.1186/s12859-023-05502-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Feature selection is important in high dimensional data analysis. The wrapper approach is one of the ways to perform feature selection, but it is computationally intensive as it builds and evaluates models of multiple subsets of features. The existing wrapper algorithm primarily focuses on shortening the path to find an optimal feature set. However, it underutilizes the capability of feature subset models, which impacts feature selection and its predictive performance. Method and Results: This study proposes a novel Artificial Intelligence based Wrapper (AIWrap) algorithm that integrates Artificial Intelligence (AI) with the existing wrapper algorithm. The algorithm develops a Performance Prediction Model using AI which predicts the model performance of any feature set and allows the wrapper algorithm to evaluate the feature subset performance in a model without building the model. The algorithm can make the wrapper algorithm more relevant for high-dimensional data. We evaluate the performance of this algorithm using simulated studies and real research studies. AIWrap shows better or at par feature selection and model prediction performance than standard penalized feature selection algorithms and wrapper algorithms. Conclusion: AIWrap approach provides an alternative algorithm to the existing algorithms for feature selection. The current study focuses on AIWrap application in continuous cross-sectional data. However, it could be applied to other datasets like longitudinal, categorical and time-to-event biological data.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Artificial Intelligence based wrapper for high dimensional feature selection
    Rahi Jain
    Wei Xu
    BMC Bioinformatics, 24
  • [2] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325
  • [3] A filter-wrapper model for high-dimensional feature selection based on evolutionary computation
    Hu, Pei
    Zhu, Jiulong
    APPLIED INTELLIGENCE, 2025, 55 (07)
  • [4] Ranking-based Feature Selection with Wrapper PSO Search in High-Dimensional Data Classification
    Saw, Thinzar
    Oo, Win Mar
    IAENG International Journal of Computer Science, 2023, 50 (01)
  • [5] Genetic Algorithm Based Wrapper Feature Selection on Hybrid Prediction Model for Analysis of High Dimensional Data
    Anirudha, R. C.
    Kannan, Remya
    Patil, Nagamma
    2014 9TH INTERNATIONAL CONFERENCE ON INDUSTRIAL AND INFORMATION SYSTEMS (ICIIS), 2014, : 290 - 295
  • [6] Boosting the Convergence of a GA-based Wrapper for Feature Selection Problems on High-dimensional Data
    Carlos Gomez-Lopez, Juan
    Jose Escobar, Juan
    Francisco Diaz, Antonio
    Damas, Miguel
    Gil-Montoya, Francisco
    Gonzalez, Jesus
    PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 431 - 434
  • [7] A wrapper for feature selection based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 618 - +
  • [8] Ensemble based on GA wrapper feature selection
    Yu, Enzhe
    Cho, Sungzoon
    COMPUTERS & INDUSTRIAL ENGINEERING, 2006, 51 (01) : 111 - 116
  • [9] Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search
    Mandal, Ashis Kumar
    Nadim, MD.
    Saha, Hasi
    Sultana, Tangina
    Hossain, Md. Delowar
    Huh, Eui-Nam
    IEEE ACCESS, 2024, 12 : 62341 - 62357
  • [10] Designing a feature selection method based on explainable artificial intelligence
    Zacharias, Jan
    von Zahn, Moritz
    Chen, Johannes
    Hinz, Oliver
    ELECTRONIC MARKETS, 2022, 32 (04) : 2159 - 2184