Supervised feature selection on gene expression microarray datasets using manifold learning

被引:6
|
作者
Zare, Masoumeh [1 ,2 ]
Azizizadeh, Najmeh [2 ]
Kazemipour, Ali [1 ,2 ,3 ]
机构
[1] Shahid Bahonar Univ Kerman, Res Inst Plant Prod Technol, Kerman, Iran
[2] Shahid Bahonar Univ Kerman, Fac Math & Comp, Dept Appl Math, Kerman, Iran
[3] Shahid Bahonar Univ Kerman, Dept Agron & Plant Breeding, Kerman, Iran
关键词
Supervised feature selection; Microarray dataset; Discriminative features; Redundant features; MULTIPLE COMPARISONS; CLASSIFICATION; TESTS;
D O I
10.1016/j.chemolab.2023.104828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent decades, the ultimate output from microarray assay, has produced enormous numbers of microarray datasets, regardless of the used technology. These datasets include complex and high dimensional samples and genes that the number of samples is much smaller than the number of genes (features). Due to the redundant dimensions in these datasets, processing them directly not only leads to poor performance but also increases computation time and memory usage. Feature selection reduces computational expense while improving or maintaining diagnosis accuracy. In this study, we propose a new supervised feature selection method based on a manifold learning approach. We focus in two different directions to address this issue. First, maximum relevancy criterion that achieves by integrating Supervised Laplacian Eigenmaps (S-LE) and a matrix, which can realize the process of feature selection. The applied criterion simultaneously opts the features that make same-class samples closer to each other and ignores the features that cause different-class samples be near. Second, minimum redundancy among selected features by applying the Pearson correlation coefficient. In the test phase, the proposed method is compared with ten state-of-the-art algorithms on seven microarray datasets. Reported results show that the proposed method has more promising performance than the other methods.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Feature Selection for Supervised Learning and Compression
    Taylor, Phillip
    Griffiths, Nathan
    Hall, Vince
    Xu, Zhou
    Mouzakitis, Alex
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [42] A Supervised Machine Learning Approach using Different Feature Selection Techniques on Voice Datasets for Prediction of Parkinson's Disease
    Aich, Satyabrata
    Kim, Hee-Cheol
    Younga, Kim
    Hui, Kueh Lee
    Al-Absi, Ahmed Abdulhakim
    Sain, Mangal
    2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 1116 - 1121
  • [43] A Hybrid Model for Optimum Gene Selection of Microarray Datasets
    Begum, Shemim
    Ansari, Ashraf Ali
    Sultan, Sadaf
    Dam, Rakhee
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 423 - 430
  • [44] Exploring the Stability of Feature Selection Methods across a Palette of Gene Expression Datasets
    Mungloo-Dilmohamud, Zahra
    Jaufeerally-Fakim, Yasmina
    Pena-Reyes, Carlos
    ICBBE 2019: 2019 6TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, 2019, : 7 - 12
  • [45] Adaptive Feature Selection and Image Classification Using Manifold Learning Techniques
    Ashraf, Amna
    Nawi, Nazri Mohd
    Aamir, Muhammad
    IEEE ACCESS, 2024, 12 : 40279 - 40289
  • [46] Optimized gene selection and classification of cancer from microarray gene expression data using deep learning
    Shah, Shamveel Hussain
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Khan, Suleman
    Rodrigues, Joel J. P. C.
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [47] Feature Selection for Microarray Gene Expression Data Using Simulated Annealing Guided by the Multivariate Joint Entropy
    Fernando Gonzalez-Navarro, Felix
    Belanche-Munoz, Lluis A.
    COMPUTACION Y SISTEMAS, 2014, 18 (02): : 275 - 293
  • [48] A self-supervised learning framework for classifying Microarray gene expression data
    Lu, Yijuan
    Tian, Qi
    Liu, Feng
    Sanchez, Maribel
    Wang, Yufeng
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 686 - 693
  • [49] A feature selection method using fixed-point algorithm for DNA microarray gene expression data
    Sharma, Alok
    Paliwal, Kuldip K.
    Imoto, Seiya
    Miyano, Satoru
    Sharma, Vandana
    Ananthanarayanan, Rajeshkannan
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2014, 18 (01) : 55 - 59
  • [50] Selection for feature gene subset in Microarray expression profiles based on a hybrid algorithm using SVM and GA
    Xiong, Wei
    Zhang, Chen
    Zhou, Chunguang
    Liang, Yanchun
    FRONTIERS OF HIGH PERFORMANCE COMPUTING AND NETWORKING - ISPA 2006 WORKSHOPS, PROCEEDINGS, 2006, 4331 : 637 - +