Autoregressive Higher-Order Hidden Markov Models: Exploiting Local Chromosomal Dependencies in the Analysis of Tumor Expression Profiles

被引:19
作者
Seifert, Michael [1 ]
Abou-El-Ardat, Khalil [2 ]
Friedrich, Betty [1 ]
Klink, Barbara [2 ]
Deutsch, Andreas [1 ]
机构
[1] Tech Univ Dresden, Ctr Informat Serv & High Performance Comp, Dresden, Germany
[2] Tech Univ Dresden, Fac Med Carl Gustav Carus, Inst Clin Genet, Dresden, Germany
关键词
COPY-NUMBER ALTERATION; ARRAY CGH; ANALYSIS REVEALS; CANCER; PATTERNS; SUBTYPES;
D O I
10.1371/journal.pone.0100295
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Changes in gene expression programs play a central role in cancer. Chromosomal aberrations such as deletions, duplications and translocations of DNA segments can lead to highly significant positive correlations of gene expression levels of neighboring genes. This should be utilized to improve the analysis of tumor expression profiles. Here, we develop a novel model class of autoregressive higher-order Hidden Markov Models (HMMs) that carefully exploit local data-dependent chromosomal dependencies to improve the identification of differentially expressed genes in tumor. Autoregressive higher-order HMMs overcome generally existing limitations of standard first-order HMMs in the modeling of dependencies between genes in close chromosomal proximity by the simultaneous usage of higher-order state-transitions and autoregressive emissions as novel model features. We apply autoregressive higher-order HMMs to the analysis of breast cancer and glioma gene expression data and perform in-depth model evaluation studies. We find that autoregressive higher-order HMMs clearly improve the identification of overexpressed genes with underlying gene copy number duplications in breast cancer in comparison to mixture models, standard first-and higher-order HMMs, and other related methods. The performance benefit is attributed to the simultaneous usage of higher-order state-transitions in combination with autoregressive emissions. This benefit could not be reached by using each of these two features independently. We also find that autoregressive higher-order HMMs are better able to identify differentially expressed genes in tumors independent of the underlying gene copy number status in comparison to the majority of related methods. This is further supported by the identification of well-known and of previously unreported hotspots of differential expression in glioblastomas demonstrating the efficacy of autoregressive higher-order HMMs for the analysis of individual tumor expression profiles. Moreover, we reveal interesting novel details of systematic alterations of gene expression levels in known cancer signaling pathways distinguishing oligodendrogliomas, astrocytomas and glioblastomas. An implementation is available under www.jstacs.de/index.php/ARHMM.
引用
收藏
页数:15
相关论文
共 64 条
[51]  
Seifert M., 2010, THESIS M LUTHER U HA
[52]   MeDIP-HMM: genome-wide identification of distinct DNA methylation states from high-density tiling arrays [J].
Seifert, Michael ;
Cortijo, Sandra ;
Colome-Tatche, Maria ;
Johannes, Frank ;
Roudier, Francois ;
Colot, Vincent .
BIOINFORMATICS, 2012, 28 (22) :2930-2939
[53]   Parsimonious Higher-Order Hidden Markov Models for Improved Array-CGH Analysis with Applications to Arabidopsis thaliana [J].
Seifert, Michael ;
Gohr, Andre ;
Strickert, Marc ;
Grosse, Ivo .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (01)
[54]   Exploiting prior knowledge and gene distances in the analysis of tumor expression profiles with extended Hidden Markov Models [J].
Seifert, Michael ;
Strickert, Marc ;
Schliep, Alexander ;
Grosse, Ivo .
BIOINFORMATICS, 2011, 27 (12) :1645-1652
[55]   Gene expression profiling identifies molecular subtypes of gliomas [J].
Shai, R ;
Shi, T ;
Kremen, TJ ;
Horvath, S ;
Liau, LM ;
Cloughesy, TF ;
Mischel, PS ;
Nelson, SF .
ONCOGENE, 2003, 22 (31) :4918-4923
[56]  
Shannon M, 2010, 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, P829
[57]   Taking time seriously: Hidden Markov experts applied to financial engineering [J].
Shi, SM ;
Weigend, AS .
PROCEEDINGS OF THE IEEE/IAFE 1997 COMPUTATIONAL INTELLIGENCE FOR FINANCIAL ENGINEERING (CIFER), 1997, :244-252
[58]   The functional role of Notch signaling in human gliomas [J].
Stockhausen, Marie-Therese ;
Kristoffersen, Karina ;
Poulsen, Hans Skovgaard .
NEURO-ONCOLOGY, 2010, 12 (02) :199-211
[59]   Extracting information from spot interest rates and credit ratings using double higher-order hidden Markov models [J].
Siu T.-K. ;
Ching W.-K. ;
Fung E.S. ;
Ng M.K. .
Computational Economics, 2005, 26 (3-4) :69-102
[60]   MACAT - microarray chromosome analysis tool [J].
Toedling, J ;
Schmeier, S ;
Heinig, M ;
Georgi, B ;
Roepcke, S .
BIOINFORMATICS, 2005, 21 (09) :2112-2113