Sim4db and Leaff: utilities for fast batch spliced alignment and sequence indexing

被引:16
作者
Walenz, Brian [2 ]
Florea, Liliana [1 ]
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] J Craig Venter Inst, Rockville, MD 20850 USA
基金
美国国家卫生研究院;
关键词
PROGRAM;
D O I
10.1093/bioinformatics/btr285
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The large number of genomes that will be sequenced will need to be annotated with genes and other functional features. Aligning gene sequences from a related species to the target genome is an economical and highly reliable method to identify genes; unfortunately, existing tools have been lacking in sensitivity and speed. A program we reported, sim4cc, was shown to be highly accurate but is limited to comparing one cDNA with one genomic sequence. We present here an optimization of the tool, implemented in the packages sim4db and leaff. The new tool performs batch alignments of cDNA and genomic sequences in a fraction of the time required by its predecessor, and thus is very well suited for genome-wide analyses.
引用
收藏
页码:1869 / 1870
页数:2
相关论文
共 6 条
[1]   A computer program for aligning a cDNA sequence with a genomic DNA sequence [J].
Florea, L ;
Hartzell, G ;
Zhang, Z ;
Rubin, GM ;
Miller, W .
GENOME RESEARCH, 1998, 8 (09) :967-974
[2]   Genome 10K: A Proposal to Obtain Whole-Genome Sequence for 10 000 Vertebrate Species [J].
Haussler, David ;
O'Brien, Stephen J. ;
Ryder, Oliver A. ;
Barker, F. Keith ;
Clamp, Michele ;
Crawford, Andrew J. ;
Hanner, Robert ;
Hanotte, Olivier ;
Johnson, Warren E. ;
McGuire, Jimmy A. ;
Miller, Webb ;
Murphy, Robert W. ;
Murphy, William J. ;
Sheldon, Frederick H. ;
Sinervo, Barry ;
Venkatesh, Byrappa ;
Wiley, Edward O. ;
Allendorf, Fred W. ;
Amato, George ;
Baker, C. Scott ;
Bauer, Aaron ;
Beja-Pereira, Albano ;
Bermingham, Eldredge ;
Bernardi, Giacomo ;
Bonvicino, Cibele R. ;
Brenner, Sydney ;
Burke, Terry ;
Cracraft, Joel ;
Diekhans, Mark ;
Edwards, Scott ;
Ericson, Per G. P. ;
Estes, James ;
Fjelsda, Jon ;
Flesness, Nate ;
Gamble, Tony ;
Gaubert, Philippe ;
Graphodatsky, Alexander S. ;
Graves, Jennifer A. Marshall ;
Green, Eric D. ;
Green, Richard E. ;
Hackett, Shannon ;
Hebert, Paul ;
Helgen, Kristofer M. ;
Joseph, Leo ;
Kessing, Bailey ;
Kingsley, David M. ;
Lewin, Harris A. ;
Luikart, Gordon ;
Martelli, Paolo ;
Moreira, Miguel A. M. .
JOURNAL OF HEREDITY, 2009, 100 (06) :659-674
[3]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[4]  
Turkey Genome Sequencing Consortium, 2010, PLOS BIOL, V8
[5]   GMAP: a genomic mapping and alignment program for mRNA and EST sequences [J].
Wu, TD ;
Watanabe, CK .
BIOINFORMATICS, 2005, 21 (09) :1859-1875
[6]   Sim4cc: a cross-species spliced alignment program [J].
Zhou, Leming ;
Pertea, Mihaela ;
Delcher, Arthur L. ;
Florea, Liliana .
NUCLEIC ACIDS RESEARCH, 2009, 37 (11)