Alignment and classification of time series gene expression in clinical studies

被引:49
作者
Lin, Tien-ho [1 ]
Kaminski, Naftali [2 ]
Bar-Joseph, Ziv [1 ]
机构
[1] Carnegie Mellon Univ, Sch Comp Sci, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Sch Med, Simmons Ctr Interstitial Lung Dis, Pittsburgh, PA 15213 USA
关键词
D O I
10.1093/bioinformatics/btn152
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Classification of tissues using static gene-expression data has received considerable attention. Recently, a growing number of expression datasets are measured as a time series. Methods that are specifically designed for this temporal data can both utilize its unique features (temporal evolution of profiles) and address its unique challenges (different response rates of patients in the same class). Results: We present a method that utilizes hidden Markov models (HMMs) for the classification task. We use HMMs with less states than time points leading to an alignment of the different patient response rates. To focus on the differences between the two classes we develop a discriminative HMM classifier. Unlike the traditional generative HMM, discriminative HMM can use examples from both classes when learning the model for a specific class. We have tested our method on both simulated and real time series expression data. As we show, our method improves upon prior methods and can suggest markers for specific disease and response stages that are not found when using traditional classifiers.
引用
收藏
页码:I147 / I155
页数:9
相关论文
共 25 条
[1]   Aligning gene expression time series with time warping algorithms [J].
Aach, J ;
Church, GM .
BIOINFORMATICS, 2001, 17 (06) :495-508
[2]  
ALIZADEH A, 2000, SCIENCE, V403, P503
[3]   Continuous representations of time-series gene expression data [J].
Bar-Joseph, Z ;
Gerber, GK ;
Gifford, DK ;
Jaakkola, TS ;
Simon, I .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (3-4) :341-356
[4]   Transcription-based prediction of response to IFNβ using supervised computational methods [J].
Baranzini, SE ;
Mousavi, P ;
Rio, J ;
Caillier, SJ ;
Stillman, A ;
Villoslada, P ;
Wyatt, MM ;
Comabella, M ;
Greller, LD ;
Somogyi, R ;
Montalban, X ;
Oksenberg, JR .
PLOS BIOLOGY, 2005, 3 (01) :166-176
[5]   PCA disjoint models for multiclass cancer analysis using gene expression data [J].
Bicciato, S ;
Luchini, A ;
Di Bello, C .
BIOINFORMATICS, 2003, 19 (05) :571-578
[6]  
Borgwardt Karsten M, 2006, Pac Symp Biocomput, P547, DOI 10.1142/9789812701626_0051
[7]   Reconstructing dynamic regulatory maps [J].
Ernst, Jason ;
Vainas, Oded ;
Harbison, Christopher T. ;
Simon, Itamar ;
Bar-Joseph, Ziv .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[8]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[9]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[10]   AN INEQUALITY FOR RATIONAL FUNCTIONS WITH APPLICATIONS TO SOME STATISTICAL ESTIMATION PROBLEMS [J].
GOPALAKRISHNAN, PS ;
KANEVSKY, D ;
NADAS, A ;
NAHAMOO, D .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (01) :107-113