Supervised feature selection on gene expression microarray datasets using manifold learning

被引:6
|
作者
Zare, Masoumeh [1 ,2 ]
Azizizadeh, Najmeh [2 ]
Kazemipour, Ali [1 ,2 ,3 ]
机构
[1] Shahid Bahonar Univ Kerman, Res Inst Plant Prod Technol, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Fac Math & Comp, Dept Appl Math, Kerman, Iran
[3] Shahid Bahonar Univ Kerman, Dept Agron & Plant Breeding, Kerman, Iran
关键词
Supervised feature selection; Microarray dataset; Discriminative features; Redundant features; MULTIPLE COMPARISONS; CLASSIFICATION; TESTS;
D O I
10.1016/j.chemolab.2023.104828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent decades, the ultimate output from microarray assay, has produced enormous numbers of microarray datasets, regardless of the used technology. These datasets include complex and high dimensional samples and genes that the number of samples is much smaller than the number of genes (features). Due to the redundant dimensions in these datasets, processing them directly not only leads to poor performance but also increases computation time and memory usage. Feature selection reduces computational expense while improving or maintaining diagnosis accuracy. In this study, we propose a new supervised feature selection method based on a manifold learning approach. We focus in two different directions to address this issue. First, maximum relevancy criterion that achieves by integrating Supervised Laplacian Eigenmaps (S-LE) and a matrix, which can realize the process of feature selection. The applied criterion simultaneously opts the features that make same-class samples closer to each other and ignores the features that cause different-class samples be near. Second, minimum redundancy among selected features by applying the Pearson correlation coefficient. In the test phase, the proposed method is compared with ten state-of-the-art algorithms on seven microarray datasets. Reported results show that the proposed method has more promising performance than the other methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Stable feature selection based on probability estimation in gene expression datasets
    Ahmadi, Melika
    Mahmoodian, Hamid
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 248
  • [32] Semi-supervised multi-label feature selection with adaptive structure learning and manifold learning
    Lv, Sitao
    Shi, Shengfei
    Wang, Hongzhi
    Li, Feng
    KNOWLEDGE-BASED SYSTEMS, 2021, 214
  • [33] Microarray gene expression classification based on supervised learning and similarity measures
    Liu, Qingzhong
    Sung, Andrew H.
    Xu, Jianyun
    Liu, Jianzhong
    Chen, Zhongxue
    2006 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-6, PROCEEDINGS, 2006, : 5094 - +
  • [34] A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis
    Lazar, Cosmin
    Taminau, Jonatan
    Meganck, Stijn
    Steenhoff, David
    Coletta, Alain
    Molter, Colin
    de Schaetzen, Virginie
    Duque, Robin
    Bersini, Hugues
    Nowe, Ann
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (04) : 1106 - 1119
  • [35] Incremental forward feature selection with application to microarray gene expression data
    Lee, Yuh-Jye
    Chang, Chien-Chung
    Chao, Chia-Huang
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2008, 18 (05) : 827 - 840
  • [36] Minimum redundancy feature selection from microarray gene expression data
    Ding, C
    Peng, HC
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 523 - 528
  • [37] Feature selection and ranking of key genes for tumor classification: Using microarray gene expression data
    Mukkamala, Srinivas
    Liu, Qingzhong
    Veeraghattam, Rajeev
    Sung, Andrew H.
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2006, PROCEEDINGS, 2006, 4029 : 951 - 961
  • [38] Optimized feature selection method using particle swarm intelligence with ensemble learning for cancer classification based on microarray datasets
    Alrefai, Nashat
    Ibrahim, Othman
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (16): : 13513 - 13528
  • [39] Optimized feature selection method using particle swarm intelligence with ensemble learning for cancer classification based on microarray datasets
    Nashat Alrefai
    Othman Ibrahim
    Neural Computing and Applications, 2022, 34 : 13513 - 13528
  • [40] Effective Feature Selection for Supervised Learning Using Genetic Algorithm
    Glaris, T. Hilda
    Rajalaxmi, R. R.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 909 - 914