Human-mouse alignments with BLASTZ

被引:896
作者
Schwartz, S
Kent, WJ
Smit, A
Zhang, Z
Baertsch, R
Hardison, RC
Haussler, D
Miller, W [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
[2] Univ Calif Santa Cruz, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
[3] Inst Syst Biol, Seattle, WA 98103 USA
[4] Paracel Inc, Pasadena, CA 91106 USA
[5] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
[6] Univ Calif Santa Cruz, Howard Hughes Med Inst, Santa Cruz, CA 95064 USA
关键词
D O I
10.1101/gr.809403
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Mouse Genome Analysis Consortium aligned the human and mouse genome sequences for a variety of purposes, using alignment programs that suited the various needs. For investigating issues regarding genome evolution, a particularly sensitive method was needed to permit alignment of a large proportion of the neutrally evolving regions. We selected a program called BLASTZ, an independent implementation of the Gapped BLAST algorithm specifically designed for aligning two long genomic sequences. BLASTZ was subsequently modified, both to attain efficiency adequate for aligning entire mammalian genomes and to increase its sensitivity. This work describes BLASTZ, its modifications, the hardware environment on which we run it, and several empirical studies to validate its results.
引用
收藏
页码:103 / 107
页数:5
相关论文
共 16 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Chiaromonte F, 2002, Pac Symp Biocomput, P115
[3]  
ELNITSKI L, 2003, GENOME RES
[4]  
HARDISON RC, 2003, GENOME RES
[5]  
HUANG X, 1994, SPRINGER LECT NOTES, V807, P87
[6]   The human genome browser at UCSC [J].
Kent, WJ ;
Sugnet, CW ;
Furey, TS ;
Roskin, KM ;
Pringle, TH ;
Zahler, AM ;
Haussler, D .
GENOME RESEARCH, 2002, 12 (06) :996-1006
[7]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[8]   Complete genomic sequence and analysis of the prion protein gene region from three mammalian species [J].
Lee, IY ;
Westaway, D ;
Smit, AFA ;
Wang, K ;
Seto, J ;
Chen, L ;
Acharya, C ;
Ankener, M ;
Baskin, D ;
Cooper, C ;
Yao, H ;
Prusiner, SB ;
Hood, LE .
GENOME RESEARCH, 1998, 8 (10) :1022-1037
[9]   PatternHunter: faster and more sensitive homology search [J].
Ma, B ;
Tromp, J ;
Li, M .
BIOINFORMATICS, 2002, 18 (03) :440-445
[10]   Comparison of genomic DNA sequences: solved and unsolved problems [J].
Miller, W .
BIOINFORMATICS, 2001, 17 (05) :391-397