Clustering of short time-course gene expression data with dissimilar replicates

被引:1
|
作者
Cinar, Ozan [1 ]
Ilk, Ozlem [2 ]
Iyigun, Cem [3 ]
机构
[1] Maastricht Univ, Dept Psychiat & Neuropsychol, Maastricht, Netherlands
[2] Middle East Tech Univ, Dept Stat, Ankara, Turkey
[3] Middle East Tech Univ, Dept Ind Engn, Ankara, Turkey
关键词
Microarray gene expression; Short time-series; Replication; Distance; Clustering; Cluster validation; SERIES DATA; MICROARRAY EXPERIMENTS; FORECAST DENSITIES; DNA MICROARRAY; CELL-CYCLE; PROFILES; PATTERNS; MODEL; CLASSIFICATION; IDENTIFICATION;
D O I
10.1007/s10479-017-2583-3
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Microarrays are used in genetics and medicine to examine large numbers of genes simultaneously through their expression levels under any condition such as a disease of interest. The information from these experiments can be enriched by following the expression levels through time and biological replicates. The purpose of this study is to propose an algorithm which clusters the genes with respect to the similarities between their behaviors through time. The algorithm is also aimed at highlighting the genes which show different behaviors between the replicates and separating the constant genes that keep their baseline expression levels throughout the study. Finally, we aim to feature cluster validation techniques to suggest a sensible number of clusters when it is not known a priori. The illustrations show that the proposed algorithm in this study offers a fast approach to clustering the genes with respect to their behavior similarities, and also separates the constant genes and the genes with dissimilar replicates without any need for pre-processing. Moreover, it is also successful at suggesting the correct number of clusters when that is not known.
引用
收藏
页码:405 / 428
页数:24
相关论文
共 50 条
  • [21] Classification of patients from time-course gene expression
    Zhang, Yuping
    Tibshirani, Robert
    Davis, Ronald
    BIOSTATISTICS, 2013, 14 (01) : 87 - 98
  • [22] A novel approach for the analysis of time-course gene expression data based on computing with words
    Rowhanimanesh, Alireza
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 120
  • [23] Clustering of time-course gene expression profiles using normal mixture models with autoregressive random effects
    Wang, Kui
    Ng, Shu Kay
    McLachlan, Geoffrey J.
    BMC BIOINFORMATICS, 2012, 13
  • [24] Bayesian Functional Mixed-effects Models with Grouped Smoothness for Analyzing Time-course Gene Expression Data
    Ye, Shangyuan
    Liang, Ye
    Zhang, Bo
    CURRENT BIOINFORMATICS, 2021, 16 (01) : 2 - 12
  • [25] Genexpi: a toolset for identifying regulons and validating gene regulatory networks using time-course expression data
    Modrak, Martin
    Vohradsky, Jiri
    BMC BIOINFORMATICS, 2018, 19
  • [26] Clustering of high throughput gene expression data
    Pirim, Harun
    Eksioglu, Burak
    Perkins, Andy D.
    Yuceer, Cetin
    COMPUTERS & OPERATIONS RESEARCH, 2012, 39 (12) : 3046 - 3061
  • [27] Clustering gene expression time course data using mixtures of multivariate t-distributions
    McNicholas, Paul D.
    Subedi, Sanjeena
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2012, 142 (05) : 1114 - 1127
  • [28] Constrained Fourier estimation of short-term time-series gene expression data reduces noise and improves clustering and gene regulatory network predictions
    Bar, Nadav
    Nikparvar, Bahareh
    Jayavelu, Naresh Doni
    Roessler, Fabienne Krystin
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [29] Inferring cluster-based networks from differently stimulated multiple time-course gene expression data
    Shiraishi, Yuichi
    Kimura, Shuhei
    Okada, Mariko
    BIOINFORMATICS, 2010, 26 (08) : 1073 - 1081
  • [30] Phase-wise Clustering of Time Series Gene Expression Data
    Goyal, Poonam
    Karwa, Rohan Sunil
    Goyal, Navneet
    John, Matthew
    TRUSTCOM 2011: 2011 INTERNATIONAL JOINT CONFERENCE OF IEEE TRUSTCOM-11/IEEE ICESS-11/FCST-11, 2011, : 1668 - 1674