Clustering of gene expression data: Performance and similarity analysis

被引:0
作者
Yin, Longde [1 ]
Huang, Chun-Hsi [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1 | 2006年
关键词
clustering algorithms; gene expression; microarray; cluster similarity analysis; performance study;
D O I
10.1109/IMSCCS.2006.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent advances of the DNA Microarray technology allow monitoring gene expression profiles of thousands of genes simultaneously. However, the analysis and handling of such fast growing data is becoming the major bottleneck in the utilization of the technology. Clustering analysis is one of the most effective methods for analyzing such gene expression data. In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering, Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA), using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. Then, we present a data mining tool, Cluster Diff, which allows the similarity analysis of clusters generated by different algorithms. A case study is conducted based on clusters generated by SOTA and SOM.
引用
收藏
页码:142 / +
页数:3
相关论文
共 50 条
  • [31] Evaluation and comparison of clustering algorithms in analyzing es cell gene expression data
    Chen, GX
    Jaradat, SA
    Banerjee, N
    Tanaka, TS
    Ko, MSH
    Zhang, MQ
    STATISTICA SINICA, 2002, 12 (01) : 241 - 262
  • [32] Data Mining in Pathway Analysis for Gene Expression
    AlAjlan, Amani
    Badr, Ghada
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2015, 2015, 9165 : 69 - 77
  • [33] Proximity Measures for Clustering Gene Expression Microarray Data: A Validation Methodology and a Comparative Analysis
    Jaskowiak, Pablo A.
    Campello, Ricardo J. G. B.
    Costa, Ivan G.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (04) : 845 - 857
  • [34] Gene expression data analysis
    Brazma, A
    Vilo, J
    MICROBES AND INFECTION, 2001, 3 (10) : 823 - 829
  • [35] Clustering Temporal Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    2007 2ND BIO-INSPIRED MODELS OF NETWORKS, INFORMATION AND COMPUTING SYSTEMS (BIONETICS), 2007, : 183 - +
  • [36] Context-specific Bayesian clustering for gene expression data
    Barash, Y
    Friedman, N
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2002, 9 (02) : 169 - 191
  • [37] PSO Based Feature Selection for Clustering Gene Expression Data
    Deepthi, P. S.
    Thampi, Sabu M.
    2015 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, INFORMATICS, COMMUNICATION AND ENERGY SYSTEMS (SPICES), 2015,
  • [38] Ensemble classification for gene expression data based on parallel clustering
    Meng, Jun
    Jiang, Dingling
    Zhang, Jing
    Luan, Yushi
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2018, 20 (03) : 213 - 229
  • [39] Consensus clustering of gene expression data and its application to gene function prediction
    Xiao, Guanghua
    Pan, Wei
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2007, 16 (03) : 733 - 751
  • [40] Application of Multi-SOM clustering approach to macrophage gene expression analysis
    Ghouila, Amel
    Ben Yahia, Sadok
    Malouche, Dhafer
    Jmel, Haifa
    Laouini, Dhafer
    Guerfali, Fatma Z.
    Abdelhak, Sonia
    INFECTION GENETICS AND EVOLUTION, 2009, 9 (03) : 328 - 336