SATRAP: SOLiD Assembler TRAnslation Program

被引:3
作者
Campagna, Davide [1 ,2 ]
Gasparini, Fabio [2 ]
Franchi, Nicola [2 ]
Manni, Lucia [2 ]
Telatin, Andrea [1 ]
Vitulo, Nicola [2 ]
Ballarin, Loriano [2 ]
Valle, Giorgio [1 ,2 ]
机构
[1] Univ Padua, CRIBI Biotechnol Ctr, Padua, Italy
[2] Univ Padua, Dept Biol, Padua, Italy
关键词
SEQUENCES; GENOME;
D O I
10.1371/journal.pone.0137436
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
SOLiD DNA sequences are typically analyzed using a reference genome, while they are not recommended for de novo assembly of genomes or transcriptomes. This is mainly due to the difficulty in translating the SOLiD color-space data into normal base-space sequences. In fact, the nature of color-space is such that any misinterpreted color leads to a chain of further translation errors, producing totally wrong results. Here we describe SATRAP, a computer program designed to efficiently translate de novo assembled color-space sequences into a base-space format. The program was tested and validated using simulated and real transcriptomic data; its modularity allows an easy integration into more complex pipelines, such as Oases for RNA-seq de novo assembly. SATRAP is available at http://satrap.cribi.unipd.it, either as a multi-step pipeline incorporating several tools for RNA-seq assembly or as an individual module for use with the Oases package.
引用
收藏
页数:7
相关论文
共 9 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
[Anonymous], CURR PROTOC BIOINFOR
[3]   PASS: a program to align short sequences [J].
Campagna, Davide ;
Albiero, Alessandro ;
Bilardi, Alessandra ;
Caniato, Elisa ;
Forcato, Claudio ;
Manavski, Svetlin ;
Vitulo, Nicola ;
Valle, Giorgio .
BIOINFORMATICS, 2009, 25 (07) :967-968
[4]   SOPRA: Scaffolding algorithm for paired reads via statistical optimization [J].
Dayarian, Adel ;
Michael, Todd P. ;
Sengupta, Anirvan M. .
BMC BIOINFORMATICS, 2010, 11
[5]   Full-length transcriptome assembly from RNA-Seq data without a reference genome [J].
Grabherr, Manfred G. ;
Haas, Brian J. ;
Yassour, Moran ;
Levin, Joshua Z. ;
Thompson, Dawn A. ;
Amit, Ido ;
Adiconis, Xian ;
Fan, Lin ;
Raychowdhury, Raktima ;
Zeng, Qiandong ;
Chen, Zehua ;
Mauceli, Evan ;
Hacohen, Nir ;
Gnirke, Andreas ;
Rhind, Nicholas ;
di Palma, Federica ;
Birren, Bruce W. ;
Nusbaum, Chad ;
Lindblad-Toh, Kerstin ;
Friedman, Nir ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2011, 29 (07) :644-U130
[6]   Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding [J].
McKernan, Kevin Judd ;
Peckham, Heather E. ;
Costa, Gina L. ;
McLaughlin, Stephen F. ;
Fu, Yutao ;
Tsung, Eric F. ;
Clouser, Christopher R. ;
Duncan, Cisyla ;
Ichikawa, Jeffrey K. ;
Lee, Clarence C. ;
Zhang, Zheng ;
Ranade, Swati S. ;
Dimalanta, Eileen T. ;
Hyland, Fiona C. ;
Sokolsky, Tanya D. ;
Zhang, Lei ;
Sheridan, Andrew ;
Fu, Haoning ;
Hendrickson, Cynthia L. ;
Li, Bin ;
Kotler, Lev ;
Stuart, Jeremy R. ;
Malek, Joel A. ;
Manning, Jonathan M. ;
Antipova, Alena A. ;
Perez, Damon S. ;
Moore, Michael P. ;
Hayashibara, Kathleen C. ;
Lyons, Michael R. ;
Beaudoin, Robert E. ;
Coleman, Brittany E. ;
Laptewicz, Michael W. ;
Sannicandro, Adam E. ;
Rhodes, Michael D. ;
Gottimukkala, Rajesh K. ;
Yang, Shan ;
Bafna, Vineet ;
Bashir, Ali ;
MacBride, Andrew ;
Alkan, Can ;
Kidd, Jeffrey M. ;
Eichler, Evan E. ;
Reese, Martin G. ;
De la Vega, Francisco M. ;
Blanchard, Alan P. .
GENOME RESEARCH, 2009, 19 (09) :1527-1541
[7]   Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels [J].
Schulz, Marcel H. ;
Zerbino, Daniel R. ;
Vingron, Martin ;
Birney, Ewan .
BIOINFORMATICS, 2012, 28 (08) :1086-1092
[8]   Fine De Novo Sequencing of a Fungal Genome Using only SOLiD Short Read Data: Verification on Aspergillus oryzae RIB40 [J].
Umemura, Myco ;
Koyama, Yoshinori ;
Takeda, Itaru ;
Hagiwara, Hiroko ;
Ikegami, Tsutomu ;
Koike, Hideaki ;
Machida, Masayuki .
PLOS ONE, 2013, 8 (05)
[9]   Assembling millions of short DNA sequences using SSAKE [J].
Warren, Rene L. ;
Sutton, Granger G. ;
Jones, Steven J. M. ;
Holt, Robert A. .
BIOINFORMATICS, 2007, 23 (04) :500-501