Clustering of short time-course gene expression data with dissimilar replicates

被引:1
|
作者
Cinar, Ozan [1 ]
Ilk, Ozlem [2 ]
Iyigun, Cem [3 ]
机构
[1] Maastricht Univ, Dept Psychiat & Neuropsychol, Maastricht, Netherlands
[2] Middle East Tech Univ, Dept Stat, Ankara, Turkey
[3] Middle East Tech Univ, Dept Ind Engn, Ankara, Turkey
关键词
Microarray gene expression; Short time-series; Replication; Distance; Clustering; Cluster validation; SERIES DATA; MICROARRAY EXPERIMENTS; FORECAST DENSITIES; DNA MICROARRAY; CELL-CYCLE; PROFILES; PATTERNS; MODEL; CLASSIFICATION; IDENTIFICATION;
D O I
10.1007/s10479-017-2583-3
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes which show different behaviors between the replicates and separating the constant genes that keep their baseline expression levels throughout the study. Finally, we aim to feature cluster validation techniques to suggest a sensible number of clusters when it is not known a priori. The illustrations show that the proposed algorithm in this study offers a fast approach to clustering the genes with respect to their behavior similarities, and also separates the constant genes and the genes with dissimilar replicates without any need for pre-processing. Moreover, it is also successful at suggesting the correct number of clusters when that is not known.
引用
收藏
页码:405 / 428
页数:24
相关论文
共 50 条
  • [31] A Model-based Approach to Transcription Regulatory Network Reconstruction from Time-Course Gene Expression Data
    Hu, Hong
    Dai, Yang
    2014 36TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2014, : 4767 - 4770
  • [32] Robust Bayesian Clustering for Replicated Gene Expression Data
    Sun, Jianyong
    Garibaldi, Jonathan M.
    Kenobi, Kim
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (05) : 1504 - 1514
  • [33] Analyzing Time-Course Microarray Data Using Functional Data Analysis - A Review
    Coffey, Norma
    Hinde, John
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [34] ConstrainedMotif: A periodicity constraint based algorithm to predict cell-cycle associated promoter motifs using time-course gene expression data
    Liu, YR
    Murthy, KRK
    Sung, WK
    BIBE 2005: 5th IEEE Symposium on Bioinformatics and Bioengineering, 2005, : 250 - 257
  • [35] Proximity Measures for Clustering Gene Expression Microarray Data: A Validation Methodology and a Comparative Analysis
    Jaskowiak, Pablo A.
    Campello, Ricardo J. G. B.
    Costa, Ivan G.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2013, 10 (04) : 845 - 857
  • [36] DTW-MIC Coexpression Networks from Time-Course Data
    Riccadonna, Samantha
    Jurman, Giuseppe
    Visintainer, Roberto
    Filosi, Michele
    Furlanello, Cesare
    PLOS ONE, 2016, 11 (03):
  • [37] Reliable Detection of Short Periodic Gene Expression Time Series Profiles in DNA Microarray Data
    Liew, Alan Wee-Chung
    Yan, Hong
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 4274 - +
  • [38] Clustering Temporal Gene Expression Data with Unequal Time Intervals
    Rueda, Luis
    Bari, Ataul
    2007 2ND BIO-INSPIRED MODELS OF NETWORKS, INFORMATION AND COMPUTING SYSTEMS (BIONETICS), 2007, : 183 - +
  • [39] State-space approach with the maximum likelihood principle to identify the system generating time-course gene expression data of yeast
    Yamaguchi, Rui
    Higuchi, Tomoyuki
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2006, 1 (01) : 77 - 87
  • [40] Time-course data analysis of gene expression profiles reveals purR regulon concerns in organic solvent tolerance in Escherichia coli
    Shimizu, K
    Hayashi, S
    Doukyu, N
    Kobayashi, T
    Honda, H
    JOURNAL OF BIOSCIENCE AND BIOENGINEERING, 2005, 99 (01) : 72 - 74