TransPi-a comprehensive TRanscriptome ANalysiS PIpeline for de novo transcriptome assembly

被引:21
|
作者
Rivera-Vicens, Ramon E. [1 ]
Garcia-Escudero, Catalina A. [1 ,2 ]
Conci, Nicola [1 ]
Eitel, Michael [1 ]
Woerheide, Gert [1 ,3 ,4 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Earth & Environm Sci Paleontol & Geobiol, Munich, Germany
[2] Ludwig Maximilians Univ Munchen, Fac Biol, Grad Sch Evolut Ecol & Systemat, Planegg Martinsried, Germany
[3] Ludwig Maximilians Univ Munchen, GeoBlo Ctr, Munich, Germany
[4] SNSB Bayer Staatssammlung Palaontol & Geol, Munich, Germany
基金
欧盟地平线“2020”;
关键词
annotation; assembly; de novo; Nextflow; nonmodel; pipeline; RNA-Seq; transcriptome; QUALITY ASSESSMENT; GENERATION; RECONSTRUCTION; ANNOTATION; ALIGNMENT;
D O I
10.1111/1755-0998.13593
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The use of RNA sequencing (RNA-Seq) data and the generation of de novo transcriptome assemblies have been pivotal for studies in ecology and evolution. This is especially true for nonmodel organisms, where no genome information is available. In such organisms, studies of differential gene expression, DNA enrichment bait design and phylogenetics can all be accomplished with de novo transcriptome assemblies. Multiple tools are available for transcriptome assembly, but no single tool can provide the best assembly for all data sets. Therefore, a multi-assembler approach, followed by a reduction step, is often sought to generate an improved representation of the assembly. To reduce errors in these complex analyses while at the same time attaining reproducibility and scalability, automated workflows have been essential in the analysis of RNA-Seq data. However, most of these tools are designed for species where genome data are used as reference for the assembly process, limiting their use in nonmodel organisms. We present TransPi, a comprehensive pipeline for de novo transcriptome assembly, with minimum user input but without losing the ability of a thorough analysis. A combination of different model organisms, k-mer sets, read lengths and read quantities was used for assessing the tool. Furthermore, a total of 49 nonmodel organisms, spanning different phyla, were also analysed. Compared to approaches using single assemblers only, TransPi produces higher BUSCO completeness percentages, and a concurrent significant reduction in duplication rates. TransPi is easy to configure and can be deployed seamlessly using Conda, Docker and Singularity.
引用
收藏
页码:2070 / 2086
页数:17
相关论文
共 50 条
  • [1] Parallelization of the Trinity pipeline for de novo transcriptome assembly
    Sachdeva, V.
    Kim, C. S.
    Jordan, K. E.
    Winn, M. D.
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL PARALLEL & DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2014, : 567 - 576
  • [2] Comparative analysis of de novo transcriptome assembly
    CLARKE Kaitlin
    YANG Yi
    MARSH Ronald
    XIE LingLin
    ZHANG Ke K.
    Science China(Life Sciences) , 2013, (02) : 156 - 162
  • [3] Comparative analysis of de novo transcriptome assembly
    Kaitlin Clarke
    Yi Yang
    Ronald Marsh
    LingLin Xie
    Zhang Ke K.
    Science China Life Sciences, 2013, 56 : 156 - 162
  • [4] Comparative analysis of de novo transcriptome assembly
    CLARKE Kaitlin
    YANG Yi
    MARSH Ronald
    XIE LingLin
    ZHANG Ke K
    Science China(Life Sciences), 2013, 56 (02) : 156 - 162
  • [5] Comparative analysis of de novo transcriptome assembly
    Clarke, Kaitlin
    Yang Yi
    Marsh, Ronald
    Xie LingLin
    Zhang, Ke K.
    SCIENCE CHINA-LIFE SCIENCES, 2013, 56 (02) : 156 - 162
  • [6] transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation
    Fallon, Timothy R.
    Calounova, Tereza
    Mokrejs, Martin
    Weng, Jing-Ke
    Pluskal, Tomas
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [7] transXpress: a Snakemake pipeline for streamlined de novo transcriptome assembly and annotation
    Timothy R. Fallon
    Tereza Čalounová
    Martin Mokrejš
    Jing-Ke Weng
    Tomáš Pluskal
    BMC Bioinformatics, 24
  • [8] De novo assembly and analysis of crow lungs transcriptome
    Vijayakumar, Periyasamy
    Raut, Ashwin Ashok
    Kumar, Pushpendra
    Sharma, Deepak
    Mishra, Anamika
    GENOME, 2014, 57 (09) : 499 - 506
  • [9] De novo transcriptome assembly with ABySS
    Birol, Inanc
    Jackman, Shaun D.
    Nielsen, Cydney B.
    Qian, Jenny Q.
    Varhol, Richard
    Stazyk, Greg
    Morin, Ryan D.
    Zhao, Yongjun
    Hirst, Martin
    Schein, Jacqueline E.
    Horsman, Doug E.
    Connors, Joseph M.
    Gascoyne, Randy D.
    Marra, Marco A.
    Jones, Steven J. M.
    BIOINFORMATICS, 2009, 25 (21) : 2872 - 2877
  • [10] De Novo Assembly and Transcriptome Analysis of Contrasting Sugarcane Varieties
    Cardoso-Silva, Claudio Benicio
    Costa, Estela Araujo
    Mancini, Melina Cristina
    Almeida Balsalobre, Thiago Willian
    Costa Canesin, Lucas Eduardo
    Pinto, Luciana Rossini
    Carneiro, Monalisa Sampaio
    Franco Garcia, Antonio Augusto
    de Souza, Anete Pereira
    Vicentini, Renato
    PLOS ONE, 2014, 9 (02):