Advancing Gene Expression Data Analysis: an Innovative Multi-objective Optimization Algorithm for Simultaneous Feature Selection and Clustering

被引:2
作者
Gupta, Pooja [1 ]
Alok, Abhay Kumar [2 ]
Sharma, Vineet [3 ]
机构
[1] Dr APJ Abdul Kalam Techn Univ, Lucknow, Uttar Pradesh, India
[2] Indian Inst Technol, Patna, India
[3] KIET Grp Inst, Ghaziabad, Delhi, India
关键词
Gene expression data Clustering; Feature selection; Point symmetry based distance; AMOSA; Cluster validity index; Feature weight index; ENSEMBLE; MODEL;
D O I
10.1590/1678-4324-2024230508
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Clustering algorithms play a crucial role in identifying co -expressed genes in microarray data, while feature subset identification is equally important when dealing with large data matrices. In this research paper, we address the problem of simultaneous feature selection and gene expression data clustering within a multiobjective optimization framework. Our approach employs the Archived multi -objective simulated annealing (AMOSA) algorithm to optimize a multi -objective function that incorporates two internal validity indices and a feature weight index. To determine data point membership in different clusters, we utilize a point symmetrybased distance metric. We demonstrate the effectiveness of our proposed approach on three publicly available gene expression datasets using the Silhouette index. Furthermore, we compare the clustering results of our approach, unsupervised feature selection and clustering using Multi -objective optimization framework (UFSC-MOO), to nine other existing techniques, showing its superior performance. Statistical significance is confirmed through Wilcoxon Rank Sum test. Also, biological significance test is employed to show that the obtained clustering solutions are biologically enriched.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A multi-objective optimization algorithm for gene selection and classification in cancer study
    Banjoko, Alabi W.
    Yahya, Waheed B.
    Olaniran, Oyebayo R.
    APPLIED SOFT COMPUTING, 2025, 172
  • [22] A multi-objective genetic algorithm for simultaneous model and feature selection for support vector machines
    Amal Bouraoui
    Salma Jamoussi
    Yassine BenAyed
    Artificial Intelligence Review, 2018, 50 : 261 - 281
  • [23] LASSO multi-objective learning algorithm for feature selection
    Frederico Coelho
    Marcelo Costa
    Michel Verleysen
    Antônio P. Braga
    Soft Computing, 2020, 24 : 13209 - 13217
  • [24] A multi-objective genetic algorithm for simultaneous model and feature selection for support vector machines
    Bouraoui, Amal
    Jamoussi, Salma
    BenAyed, Yassine
    ARTIFICIAL INTELLIGENCE REVIEW, 2018, 50 (02) : 261 - 281
  • [25] A multi-objective immune algorithm for intrusion feature selection
    Wei, Wenhong
    Chen, Shuo
    Lin, Qiuzhen
    Ji, Junkai
    Chen, Jianyong
    APPLIED SOFT COMPUTING, 2020, 95
  • [26] LASSO multi-objective learning algorithm for feature selection
    Coelho, Frederico
    Costa, Marcelo
    Verleysen, Michel
    Braga, Antonio P.
    SOFT COMPUTING, 2020, 24 (17) : 13209 - 13217
  • [27] Feature selection of unreliable data using an improved multi-objective PSO algorithm
    Zhang Yong
    Gong Dun-wei
    Zhang Wan-qiu
    NEUROCOMPUTING, 2016, 171 : 1281 - 1290
  • [28] A hybrid multi-objective genetic algorithm for gene selection in microarray data
    Su, Yizhou
    Zhao, Guohua
    Lin, Yusong
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 443 - 449
  • [29] Simultaneous Feature and Sample Selection Using Ensemble Multi-objective Search Space Enhanced Modified Whale Optimization Algorithm
    Sathya, M.
    Priya, S. Manju
    INTERNATIONAL TRANSACTION JOURNAL OF ENGINEERING MANAGEMENT & APPLIED SCIENCES & TECHNOLOGIES, 2021, 12 (06):
  • [30] Multi-Objective Feature Selection With Missing Data in Classification
    Xue, Yu
    Tang, Yihang
    Xu, Xin
    Liang, Jiayu
    Neri, Ferrante
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 355 - 364