SITEBLAST - rapid and sensitive local alignment of genomic sequences employing motif anchors

被引:12
作者
Michael, M [1 ]
Dieterich, C [1 ]
Vingron, M [1 ]
机构
[1] Max Planck Inst Mol Genet, D-14195 Berlin, Germany
关键词
D O I
10.1093/bioinformatics/bti224
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Comparative sequence analysis is the essence of many approaches to genome annotation. Heuristic alignment algorithms utilize similar seed pairs to anchor an alignment. Some applications of local alignment algorithms (e.g. phylogenetic footprinting) would benefit from including prior knowledge (e.g. binding site motifs) in the alignment building process. Results: We introduce predefined sequence patterns as anchor points into a heuristic local alignment strategy. We extended the BLASTZ program for this purpose. A set of seed patterns is either given as consensus sequences in IUPAC code or position-weight-matrices. Phylogenetic footprinting of promoter regions is one of many potential applications for the SITEBLAST software.
引用
收藏
页码:2093 / 2094
页数:2
相关论文
共 8 条
[1]   EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH [J].
AHO, AV ;
CORASICK, MJ .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :333-340
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Berezikov E, 2004, GENOME RES, V14, P170
[4]   CORG: a database for COmparative Regulatory Genomics [J].
Dieterich, C ;
Wang, H ;
Rateitschak, K ;
Luz, H ;
Vingron, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :55-57
[5]   Identifying DNA and protein patterns with statistically significant alignments of multiple sequences [J].
Hertz, GZ ;
Stormo, GD .
BIOINFORMATICS, 1999, 15 (7-8) :563-577
[6]  
Rahmann S, 2003, STAT APPL GENET MOL, V2, DOI [DOI 10.2202/1544-6115.1032, 10.2202/1544-6115.1032]
[7]   Human-mouse alignments with BLASTZ [J].
Schwartz, S ;
Kent, WJ ;
Smit, A ;
Zhang, Z ;
Baertsch, R ;
Hardison, RC ;
Haussler, D ;
Miller, W .
GENOME RESEARCH, 2003, 13 (01) :103-107