Clustering of short time-course gene expression data with dissimilar replicates

被引:1
|
作者
Cinar, Ozan [1 ]
Ilk, Ozlem [2 ]
Iyigun, Cem [3 ]
机构
[1] Maastricht Univ, Dept Psychiat & Neuropsychol, Maastricht, Netherlands
[2] Middle East Tech Univ, Dept Stat, Ankara, Turkey
[3] Middle East Tech Univ, Dept Ind Engn, Ankara, Turkey
关键词
Microarray gene expression; Short time-series; Replication; Distance; Clustering; Cluster validation; SERIES DATA; MICROARRAY EXPERIMENTS; FORECAST DENSITIES; DNA MICROARRAY; CELL-CYCLE; PROFILES; PATTERNS; MODEL; CLASSIFICATION; IDENTIFICATION;
D O I
10.1007/s10479-017-2583-3
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes which show different behaviors between the replicates and separating the constant genes that keep their baseline expression levels throughout the study. Finally, we aim to feature cluster validation techniques to suggest a sensible number of clusters when it is not known a priori. The illustrations show that the proposed algorithm in this study offers a fast approach to clustering the genes with respect to their behavior similarities, and also separates the constant genes and the genes with dissimilar replicates without any need for pre-processing. Moreover, it is also successful at suggesting the correct number of clusters when that is not known.
引用
收藏
页码:405 / 428
页数:24
相关论文
共 50 条
  • [11] Clustering longitudinal profiles using P-splines and mixed effects models applied to time-course gene expression data
    Coffey, N.
    Hinde, J.
    Holian, E.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 14 - 29
  • [12] Nonlinear-Model-Based Analysis Methods for Time-Course Gene Expression Data
    Tian, Li-Ping
    Liu, Li-Zhi
    Wu, Fang-Xiang
    SCIENTIFIC WORLD JOURNAL, 2014,
  • [13] Optimal classification for time-course gene expression data using functional data analysis
    Song, Joon Jin
    Deng, Weiguo
    Lee, Ho-Jin
    Kwon, Deukwoo
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2008, 32 (06) : 426 - 432
  • [14] An empirical Bayes approach for analysis of diverse periodic trends in time-course gene expression data
    Kocak, Mehmet
    George, E. Olusegun
    Pyne, Saumyadipta
    Pounds, Stanley
    BIOINFORMATICS, 2013, 29 (02) : 182 - 188
  • [15] Maximization of negative correlations in time-course gene expression data for enhancing understanding of molecular pathways
    Zeng, Tao
    Li, Jinyan
    NUCLEIC ACIDS RESEARCH, 2010, 38 (01) : e1 - e1
  • [16] Improved Inference of Gene Regulatory Networks through Integrated Bayesian Clustering and Dynamic Modeling of Time-Course Expression Data
    Godsey, Brian
    PLOS ONE, 2013, 8 (07):
  • [17] Clustering of Time-Course Microarray Data Using Pharmacokinetic Parameter
    Lee, Hyo-Jung
    Kim, Peol-A
    Park, Mira
    KOREAN JOURNAL OF APPLIED STATISTICS, 2011, 24 (04) : 623 - 631
  • [18] Nonparametric Bayesian functional clustering for time-course microarray data
    Wei, Ziwen
    Kuo, Lynn
    STATISTICS AND ITS INTERFACE, 2014, 7 (04) : 543 - 557
  • [19] Comparative analysis of clustering methods for gene expression time course data
    Costa, IG
    de Carvalho, FDT
    de Souto, MCP
    GENETICS AND MOLECULAR BIOLOGY, 2004, 27 (04) : 623 - 631
  • [20] Significance analysis of time-course gene expression profiles
    Wu, Fang-Xiang
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PROCEEDINGS, 2007, 4463 : 13 - 24