Distance based feature selection for clustering microarray data

被引:0
作者
Dash, Manoranjan [1 ]
Gopalkrishnan, Vivekanand [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS | 2008年 / 4947卷
关键词
feature selection; clustering; distance function; microarray data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In microarray data, clustering is the fundamental task for separating genes into biologically functional groups or for classifying tissues and phenotypes. Recently, with innovative gene expression microarray data technologies, thousands of expression levels of genes (features) can be measured simultaneously in a single experiment. The large number of genes with a lot of noise causes high complexity for cluster analysis. This challenge has raised the demand for feature selection - an effective dimensionality reduction technique that removes noisy features. In this paper we propose a novel filter method for feature selection. The suggested method, called ClosestFS, is based on a distance measure. For each feature, the distance is evaluated by computing its impact on the histogram for the whole data. Our experimental results show that the quality of clustering results (evaluated by several widely used measures) of K-means algorithm using ClosestFS as the pre-processing step is significantly better than that of the pure K-means.
引用
收藏
页码:512 / 519
页数:8
相关论文
共 50 条
  • [41] Feature selection using feature dissimilarity measure and density-based clustering: Application to biological data
    Debarka Sengupta
    Indranil Aich
    Sanghamitra Bandyopadhyay
    Journal of Biosciences, 2015, 40 : 721 - 730
  • [42] An effective distance based feature selection approach for imbalanced data
    Shaukat Ali Shahee
    Usha Ananthakumar
    Applied Intelligence, 2020, 50 : 717 - 745
  • [43] An effective distance based feature selection approach for imbalanced data
    Shahee, Shaukat Ali
    Ananthakumar, Usha
    APPLIED INTELLIGENCE, 2020, 50 (03) : 717 - 745
  • [44] Feature selection from microarray data : Genetic algorithm based approach
    Ram, Pintu Kumar
    Kuila, Pratyay
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2019, 40 (08) : 1599 - 1610
  • [45] Stable and Accurate Feature Selection from Microarray Data with Ensembled Fast Correlation Based Filter
    Wang, Aiguo
    Liu, Huancheng
    Liu, Jinjun
    Ding, Huitong
    Yang, Jing
    Chen, Guilin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2996 - 2998
  • [46] Multimodal feature selection from microarray data based on Dempster-Shafer evidence fusion
    Nekouie, Nadia
    Romoozi, Morteza
    Esmaeili, Mahdi
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (11) : 12591 - 12621
  • [47] Differential Privacy High-Dimensional Data Publishing Based on Feature Selection and Clustering
    Chu, Zhiguang
    He, Jingsha
    Zhang, Xiaolei
    Zhang, Xing
    Zhu, Nafei
    ELECTRONICS, 2023, 12 (09)
  • [48] Ensemble Feature Selection for Breast Cancer Classification using Microarray Data
    Hengpraprohm, Supoj
    Jungjit, Suwimol
    INTELIGENCIA ARTIFICIAL-IBEROAMERICAL JOURNAL OF ARTIFICIAL INTELLIGENCE, 2020, 23 (65): : 100 - 114
  • [49] A Meta-Review of Feature Selection Techniques in the Context of Microarray Data
    Mungloo-Dilmohamud, Zahra
    Jaufeerally-Fakim, Yasmina
    Pena-Reyes, Carlos
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT I, 2017, 10208 : 33 - 49
  • [50] Feature Selection of Microarray Data Using Simulated Kalman Filter with Mutation
    Zamri, Nurhawani Ahmad
    Aziz, Nor Azlina Ab
    Bhuvaneswari, Thangavel
    Aziz, Nor Hidayati Abdul
    Ghazali, Anith Khairunnisa
    PROCESSES, 2023, 11 (08)