Distance based feature selection for clustering microarray data

被引:0
作者
Dash, Manoranjan [1 ]
Gopalkrishnan, Vivekanand [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
DATABASE SYSTEMS FOR ADVANCED APPLICATIONS | 2008年 / 4947卷
关键词
feature selection; clustering; distance function; microarray data;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In microarray data, clustering is the fundamental task for separating genes into biologically functional groups or for classifying tissues and phenotypes. Recently, with innovative gene expression microarray data technologies, thousands of expression levels of genes (features) can be measured simultaneously in a single experiment. The large number of genes with a lot of noise causes high complexity for cluster analysis. This challenge has raised the demand for feature selection - an effective dimensionality reduction technique that removes noisy features. In this paper we propose a novel filter method for feature selection. The suggested method, called ClosestFS, is based on a distance measure. For each feature, the distance is evaluated by computing its impact on the histogram for the whole data. Our experimental results show that the quality of clustering results (evaluated by several widely used measures) of K-means algorithm using ClosestFS as the pre-processing step is significantly better than that of the pure K-means.
引用
收藏
页码:512 / 519
页数:8
相关论文
共 50 条
  • [1] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Swarnkar, Tripti
    Mitra, Pabitra
    JOURNAL OF BIOSCIENCES, 2015, 40 (04) : 755 - 767
  • [2] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Tripti Swarnkar
    Pabitra Mitra
    Journal of Biosciences, 2015, 40 : 755 - 767
  • [3] Clustering-based hybrid feature selection approach for high dimensional microarray data
    Babu, Samson Anosh P.
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [4] A Clustering Based Feature Selection Method Using Feature Information Distance for Text Data
    Chao, Shilong
    Cai, Jie
    Yang, Sheng
    Wang, Shulin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT I, 2016, 9771 : 122 - 132
  • [5] Simultaneous Clustering and Feature Selection Using Social Group Optimization With Dynamic Threshold Setting for Microarray Data
    Meesala, Y.V. Nagesh
    Parida, Ajaya Kumar
    Naik, Anima
    Informatica (Slovenia), 2024, 48 (23): : 199 - 218
  • [6] Prominent feature selection of microarray data
    Liu, Yihui
    PROGRESS IN NATURAL SCIENCE-MATERIALS INTERNATIONAL, 2009, 19 (10) : 1365 - 1371
  • [7] Prominent feature selection of microarray data
    Yihui Liu School of Computer Science and Information Technology
    Progress in Natural Science, 2009, 19 (10) : 1365 - 1371
  • [8] FEATURE DISCRETIZATION AND SELECTION IN MICROARRAY DATA
    Ferreira, Artur
    Figueiredo, Mario
    KDIR 2011: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND INFORMATION RETRIEVAL, 2011, : 465 - 469
  • [9] A Discernibility-Based Approach to Feature Selection for Microarray Data
    Voulgaris, Zacharias
    Magoulas, George D.
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 818 - 823
  • [10] Robust microarray data feature selection using a correntropy based distance metric learning approach
    Vahabzadeh, Venus
    Moattar, Mohammad Hossein
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 161