A jumping profile Hidden Markov Model and applications to recombination sites in HIV and HCV genomes

被引:75
作者
Schultz, Anne-Kathrin
Zhang, Ming
Leitner, Thomas
Kuiken, Carla
Korber, Bette
Morgenstern, Burkhard
Stanke, Mario
机构
[1] Univ Gottingen, Inst Microbiol & Genet, D-37077 Gottingen, Germany
[2] Los Alamos Natl Lab, Div Theoret, Los Alamos, NM 87545 USA
[3] Santa Fe Inst, Santa Fe, NM 87501 USA
关键词
D O I
10.1186/1471-2105-7-265
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Jumping alignments have recently been proposed as a strategy to search a given multiple sequence alignment A against a database. Instead of comparing a database sequence S to the multiple alignment or profile as a whole, S is compared and aligned to individual sequences from A. Within this alignment, S can jump between different sequences from A, so different parts of S can be aligned to different sequences from the input multiple alignment. This approach is particularly useful for dealing with recombination events. Results: We developed a jumping profile Hidden Markov Model (jpHMM), a probabilistic generalization of the jumping-alignment approach. Given a partition of the aligned input sequence family into known sequence subtypes, our model can jump between states corresponding to these different subtypes, depending on which subtype is locally most similar to a database sequence. Jumps between different subtypes are indicative of intersubtype recombinations. We applied our method to a large set of genome sequences from human immunodeficiency virus (HIV) and hepatitis C virus (HCV) as well as to simulated recombined genome sequences. Conclusion: Our results demonstrate that jumps in our jumping profile HMM often correspond to recombination breakpoints; our approach can therefore be used to detect recombinations in genomic sequences. The recombination breakpoints identified by jpHMM were found to be significantly more accurate than breakpoints defined by traditional methods based on comparing single representative sequences.
引用
收藏
页数:15
相关论文
共 44 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]   Evidence of intratypic recombination in natural populations of hepatitis C virus [J].
Colina, R ;
Casane, D ;
Vasquez, S ;
García-Aguirre, L ;
Chunga, A ;
Romero, C ;
Khan, B ;
Cristina, J .
JOURNAL OF GENERAL VIROLOGY, 2004, 85 :31-37
[3]  
Durbin R., 1998, BIOL SEQUENCE ANAL
[4]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[5]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[6]   Prevalence and changes in hepatitis C virus genotypes among multitransfused persons with hemophilia [J].
Eyster, ME ;
Sherman, KE ;
Goedert, JJ ;
Katsoulidou, A ;
Hatzakis, A .
JOURNAL OF INFECTIOUS DISEASES, 1999, 179 (05) :1062-1069
[7]   Gene recognition via spliced sequence alignment [J].
Gelfand, MS ;
Mironov, AA ;
Pevzner, PA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (17) :9061-9066
[8]   Detection of HIV-1 subtypes, recombinants, and dual infections in east Africa by a multi-region hybridization assay [J].
Hoelscher, M ;
Dowling, WE ;
Sanders-Buell, E ;
Carr, JK ;
Harris, ME ;
Thomschke, A ;
Robb, ML ;
Birx, DL ;
McCutchan, FE .
AIDS, 2002, 16 (15) :2055-2064
[9]   Phylogenetic evidence for recombination in dengue virus [J].
Holmes, EC ;
Worobey, M ;
Rambaut, A .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (03) :405-409
[10]   A natural intergenotypic recombinant of hepatitis C virus identified in St. Petersburg [J].
Kalinina, O ;
Norder, H ;
Mukomolov, S ;
Magnius, LO .
JOURNAL OF VIROLOGY, 2002, 76 (08) :4034-4043