A model-based relevance estimation approach for feature selection in microarray datasets

被引:0
|
作者
Bontempi, Gianluca [1 ]
Meyer, Patrick E. [1 ]
机构
[1] Univ Libre Bruxelles, Dept Comp Sci, Machine Learning Grp, Brussels, Belgium
来源
ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT II | 2008年 / 5164卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The paper presents an original model-based approach for feature selection and its application to classification of microarray datasets. Model-based approaches to feature selection are generally denoted as wrappers. Wrapper methods assess subsets of variables according to their usefulness to a given prediction model which will be eventually used for classification. This strategy assumes that the accuracy of the model used for the wrapper selection is a good estimator of the relevance of the feature subset. We first discuss the limits of this assumption by showing that tire assessment of a subset by means of a generic learner (e.g. by cross-validation) returns a biased estimate of the relevance of the Subset itself. Secondly, we propose a low-bias estimator of the relevance based on the cross-validation assessment of an unbiased learner. Third, we assess a feature selection approach which combines the low-bias relevance estimator with state-of-the-art relevance estimators in order to enhance their accuracy. The experimental validation on 20 publicly available cancer expression datasets shows the robustness of a selection approach which is riot biased by a specific learner.
引用
收藏
页码:21 / 31
页数:11
相关论文
共 50 条
  • [1] Query Large Scale Microarray Compendium Datasets Using a Model-Based Bayesian Approach with Variable Selection
    Hu, Ming
    Qin, Zhaohui S.
    PLOS ONE, 2009, 4 (02):
  • [2] A GA-Based Approach to ICA Feature Selection: An Efficient Method to Classify Microarray Datasets
    Liu, Kun-Hong
    Zhang, Jun
    Li, Bo
    Du, Ji-Xiang
    ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 : 432 - +
  • [3] A hybrid feature selection approach for Microarray datasets using graph theoretic-based method
    Chamlal, Hasna
    Ouaderhman, Tayeb
    Rebbah, Fatima Ezzahra
    INFORMATION SCIENCES, 2022, 615 : 449 - 474
  • [4] Hybrid feature selection based on SLI and genetic algorithm for microarray datasets
    Sedighe Abasabadi
    Hossein Nematzadeh
    Homayun Motameni
    Ebrahim Akbari
    The Journal of Supercomputing, 2022, 78 : 19725 - 19753
  • [5] Hybrid feature selection based on SLI and genetic algorithm for microarray datasets
    Abasabadi, Sedighe
    Nematzadeh, Hossein
    Motameni, Homayun
    Akbari, Ebrahim
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (18): : 19725 - 19753
  • [6] A review of microarray datasets and applied feature selection methods
    Bolon-Canedo, V.
    Sanchez-Marono, N.
    Alonso-Betanzos, A.
    Benitez, J. M.
    Herrera, F.
    INFORMATION SCIENCES, 2014, 282 : 111 - 135
  • [7] Ant Colony Algorithm for Feature Selection on Microarray Datasets
    Fahrudin, Tresna Maulana
    Syarif, Iwan
    Barakbah, Ali Ridho
    2016 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2016, : 351 - 356
  • [8] A graph partitioning-based hybrid feature selection method in microarray datasets
    Oubaouzine, Abdelali
    Ouaderhman, Tayeb
    Chamlal, Hasna
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (01) : 633 - 660
  • [9] OVA scheme vs. single machine approach in feature selection for microarray datasets
    Ooi, Chia Huey
    Chetty, Madhu
    Teng, Shyh Wei
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 10 - 23
  • [10] Improved Swarm Intelligence Optimization Model with Mutual Information Estimation for Feature Selection in Microarray Bioinformatics Datasets for Diseases Diagnosis
    Krishna, Peddarapu Rama
    Rajarajeswari, Pothuraju
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2023, 23 (02) : 117 - 129