Clustering of gene expression data: Performance and similarity analysis

被引:0
作者
Yin, Longde [1 ]
Huang, Chun-Hsi [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1 | 2006年
关键词
clustering algorithms; gene expression; microarray; cluster similarity analysis; performance study;
D O I
10.1109/IMSCCS.2006.43
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recent advances of the DNA Microarray technology allow monitoring gene expression profiles of thousands of genes simultaneously. However, the analysis and handling of such fast growing data is becoming the major bottleneck in the utilization of the technology. Clustering analysis is one of the most effective methods for analyzing such gene expression data. In this paper we first experimentally study three major clustering algorithms: Hierarchical Clustering, Self-Organizing Map (SOM), and Self Organizing Tree Algorithm (SOTA), using Yeast Saccharomyces cerevisiae gene expression data, and compare their performance. Then, we present a data mining tool, Cluster Diff, which allows the similarity analysis of clusters generated by different algorithms. A case study is conducted based on clusters generated by SOTA and SOM.
引用
收藏
页码:142 / +
页数:3
相关论文
共 50 条
  • [21] The "Gene Cube": A Novel Approach to Three-dimensional Clustering of Gene Expression Data
    Lambrou, George I.
    Sdraka, Maria
    Koutsouris, Dimitrios
    CURRENT BIOINFORMATICS, 2019, 14 (08) : 721 - 727
  • [22] Feature selection and gene clustering from gene expression data
    Mitra, P
    Majumder, DD
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, 2004, : 343 - 346
  • [23] Reducing the Subjectivity of Gene Expression Data Clustering Based on Spatial Contiguity Analysis
    Yi, Hui
    Song, Xiaofeng
    Jiang, Bin
    Liu, Yufang
    DATABASE THEORY AND APPLICATION, BIO-SCIENCE AND BIO-TECHNOLOGY, 2011, 258 : 118 - 124
  • [24] Temporal and Multivariate Similarity Clustering of 5G Performance Data
    Mazgula, Jakub
    Krol, Dariusz
    Jablonski, Ireneusz
    IEEE ACCESS, 2024, 12 : 114137 - 114145
  • [25] DYNAMIC CORE BASED CLUSTERING OF GENE EXPRESSION DATA
    Bocicor, Maria-Iuliana
    Sirbu, Adela
    Czibula, Gabriela
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2014, 10 (03): : 1051 - 1069
  • [26] Statistical inference for simultaneous clustering of gene expression data
    Pollard, KS
    van der Laan, MJ
    MATHEMATICAL BIOSCIENCES, 2002, 176 (01) : 99 - 121
  • [27] Clustering of Association Rules on Microarray Gene Expression Data
    Alagukumar, S.
    Vanitha, C. Devi Arockia
    Lawrance, R.
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 85 - 97
  • [28] CIS: A nonparametric clustering algorithm for gene expression data
    Zhao, YH
    Yin, Y
    Wang, GR
    Mao, KM
    PROCEEDINGS OF THE 11TH JOINT INTERNATIONAL COMPUTER CONFERENCE, 2005, : 651 - 656
  • [29] An optimal hierarchical clustering algorithm for gene expression data
    Seal, S
    Komarina, S
    Aluru, S
    INFORMATION PROCESSING LETTERS, 2005, 93 (03) : 143 - 147
  • [30] A sequential clustering algorithm with applications to gene expression data
    Song, Jongwoo
    Nicolae, Dan L.
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2009, 38 (02) : 175 - 184