Simultaneous feature selection and clustering of micro-array and RNA-sequence gene expression data using multiobjective optimization

被引:6
|
作者
Alok, Abhay Kumar [1 ]
Gupta, Pooja [2 ]
Saha, Sriparna [1 ]
Sharma, Vineet [2 ]
机构
[1] Indian Inst Technol, Comp Sci Engn, Patna, Bihar, India
[2] AKTU, Comp Sci Engn, Krishna Inst Engn & Technol, Lucknow, Uttar Pradesh, India
关键词
Gene expression data clustering; Feature selection; Point symmetry based distance; Multiobjective optimization; Cluster validity index; ALGORITHM; DISTANCE;
D O I
10.1007/s13042-020-01139-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we have devised a multiobjective optimization solution framework for solving the problem of gene expression data clustering in reduced feature space. Here clustering problem is viewed from two different aspects: clustering of genes in reduced sample space or clustering of samples in reduced gene space. Three objective functions: two internal cluster validity indices and the count on the number of features are optimized simultaneously by a popular multiobjective simulated annealing based approach, namely AMOSA. Here, point symmetry based distance is used for the assignment of gene data points to different clusters. Seven publicly available benchmark gene expression data sets are used for experimental purpose. Both aspects of clustering in reduced feature space is demonstrated. The proposed gene expression clustering technique outperforms the existing nine clustering techniques. Apart from this, also some statistical and biological significant tests have been carried out to show that the proposed FSC-MOO technique is more statistically and biologically enriched
引用
收藏
页码:2541 / 2563
页数:23
相关论文
共 50 条
  • [21] Simultaneous Clustering and Feature Selection Using Social Group Optimization With Dynamic Threshold Setting for Microarray Data
    Meesala, Y.V. Nagesh
    Parida, Ajaya Kumar
    Naik, Anima
    Informatica (Slovenia), 2024, 48 (23): : 199 - 218
  • [22] A Dissolving P System for Multi-objective Gene Combination Selection from Micro-array Data
    Liu, Fan
    Tuo, Shouheng
    Li, Chao
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 369 - 376
  • [23] Feature Selection and Clustering of Gene Expression Profiles Using Biological Knowledge
    Mitra, Sushmita
    Ghosh, Sampreeti
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1590 - 1599
  • [24] Particle Swarm Optimization with K-means for Simultaneous Feature Selection and Data Clustering
    Prakash, Jay
    Singh, Pramod Kumar
    2015 SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MACHINE INTELLIGENCE (ISCMI), 2015, : 74 - 78
  • [25] A Hybrid Feature Selection Method Using Gene Expression Data
    Chuang, Li-Yeh
    Wu, Kuo-Chuan
    Yang, Cheng-Hong
    2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, : 100 - +
  • [26] Feature Selection for Alzheimer's Gene Expression Data Using Modified Binary Particle Swarm Optimization
    Ramaswamy, Ramya
    Kandhasamy, Premalatha
    Palaniswamy, Swathypriyadharsini
    IETE JOURNAL OF RESEARCH, 2023, 69 (01) : 9 - 20
  • [27] Binary Political Optimizer for Feature Selection Using Gene Expression Data
    Manita, Ghaith
    Korbaa, Ouajdi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2020, 2020
  • [28] Feature Selection and Classification for Gene Expression Data using Evolutionary Computation
    Banka, Haider
    Dara, Suresh
    2012 23RD INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS (DEXA), 2012, : 185 - 189
  • [29] Feature Selection Using Information Distance Measure for Gene Expression Data
    Cai, Jie
    Liang, Cheng
    Luo, Jiawei
    CURRENT PROTEOMICS, 2018, 15 (05) : 352 - 362
  • [30] Improved binary PSO for feature selection using gene expression data
    Chuang, Li-Yeh
    Chang, Hsueh-Wei
    Tu, Chung-Jui
    Yang, Cheng-Hong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (01) : 29 - 38