TransPi-a comprehensive TRanscriptome ANalysiS PIpeline for de novo transcriptome assembly

被引:21
|
作者
Rivera-Vicens, Ramon E. [1 ]
Garcia-Escudero, Catalina A. [1 ,2 ]
Conci, Nicola [1 ]
Eitel, Michael [1 ]
Woerheide, Gert [1 ,3 ,4 ]
机构
[1] Ludwig Maximilians Univ Munchen, Dept Earth & Environm Sci Paleontol & Geobiol, Munich, Germany
[2] Ludwig Maximilians Univ Munchen, Fac Biol, Grad Sch Evolut Ecol & Systemat, Planegg Martinsried, Germany
[3] Ludwig Maximilians Univ Munchen, GeoBlo Ctr, Munich, Germany
[4] SNSB Bayer Staatssammlung Palaontol & Geol, Munich, Germany
基金
欧盟地平线“2020”;
关键词
annotation; assembly; de novo; Nextflow; nonmodel; pipeline; RNA-Seq; transcriptome; QUALITY ASSESSMENT; GENERATION; RECONSTRUCTION; ANNOTATION; ALIGNMENT;
D O I
10.1111/1755-0998.13593
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The use of RNA sequencing (RNA-Seq) data and the generation of de novo transcriptome assemblies have been pivotal for studies in ecology and evolution. This is especially true for nonmodel organisms, where no genome information is available. In such organisms, studies of differential gene expression, DNA enrichment bait design and phylogenetics can all be accomplished with de novo transcriptome assemblies. Multiple tools are available for transcriptome assembly, but no single tool can provide the best assembly for all data sets. Therefore, a multi-assembler approach, followed by a reduction step, is often sought to generate an improved representation of the assembly. To reduce errors in these complex analyses while at the same time attaining reproducibility and scalability, automated workflows have been essential in the analysis of RNA-Seq data. However, most of these tools are designed for species where genome data are used as reference for the assembly process, limiting their use in nonmodel organisms. We present TransPi, a comprehensive pipeline for de novo transcriptome assembly, with minimum user input but without losing the ability of a thorough analysis. A combination of different model organisms, k-mer sets, read lengths and read quantities was used for assessing the tool. Furthermore, a total of 49 nonmodel organisms, spanning different phyla, were also analysed. Compared to approaches using single assemblers only, TransPi produces higher BUSCO completeness percentages, and a concurrent significant reduction in duplication rates. TransPi is easy to configure and can be deployed seamlessly using Conda, Docker and Singularity.
引用
收藏
页码:2070 / 2086
页数:17
相关论文
共 50 条
  • [21] De Novo Assembly and Analysis of Polygonatum sibiricum Transcriptome and Identification of Genes Involved in Polysaccharide Biosynthesis
    Wang, Shiqiang
    Wang, Bin
    Hua, Wenping
    Niu, Junfeng
    Dang, Kaikai
    Qiang, Yi
    Wang, Zhezhi
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (09)
  • [22] De novo sequencing and assembly of Azadirachta indica fruit transcriptome
    Krishnan, Neeraja M.
    Pattnaik, Swetansu
    Deepak, S. A.
    Hariharan, Arun K.
    Gaur, Prakhar
    Chaudhary, Rakshit
    Jain, Prachi
    Vaidyanathan, Srividya
    Krishna, P. G. Bharath
    Panda, Binay
    CURRENT SCIENCE, 2011, 101 (12): : 1553 - 1561
  • [23] De Novo Transcriptome Assembly of Pummelo and Molecular Marker Development
    Liang, Mei
    Yang, Xiaoming
    Li, Hang
    Su, Shiying
    Yi, Hualin
    Chai, Lijun
    Deng, Xiuxin
    PLOS ONE, 2015, 10 (03):
  • [24] De novo assembly and annotation of the whole transcriptome of Muraenesox cinereus
    Shan, Binbin
    Liu, Yan
    Yang, Changping
    Wang, Liangming
    Li, Yuan
    Sun, Dianrong
    MARINE GENOMICS, 2022, 61
  • [25] De Novo Assembly and Characterization of the Transcriptome of Grasshopper Shirakiacris shirakii
    Qiu, Zhongying
    Liu, Fei
    Lu, Huimeng
    Yuan, Hao
    Zhang, Qin
    Huang, Yuan
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (07)
  • [26] Evolutionary insights from de novo transcriptome assembly and SNP discovery in California white oaks
    Shawn J. Cokus
    Paul F. Gugger
    Victoria L. Sork
    BMC Genomics, 16
  • [27] Sequencing, de novo assembly and comparative analysis of Raphanus sativus transcriptome
    Wu, Gang
    Zhang, Libin
    Yin, Yongtai
    Wu, Jiangsheng
    Yu, Longjiang
    Zhou, Yanhong
    Li, Maoteng
    FRONTIERS IN PLANT SCIENCE, 2015, 6
  • [28] De Novo Transcriptome Assembly and Analysis of the Flat Oyster Pathogenic Protozoa Bonamia Ostreae
    Chevignon, Germain
    Dotto-Maurel, Aurelie
    Serpin, Delphine
    Chollet, Bruno
    Arzul, Isabelle
    FRONTIERS IN CELLULAR AND INFECTION MICROBIOLOGY, 2022, 12
  • [29] De Novo Assembly, Characterization and Comparative Transcriptome Analysis of the Mature Gonads in Spinibarbus hollandi
    Han, Chong
    Huang, Wenwei
    Peng, Suhan
    Zhou, Jiangwei
    Zhan, Huawei
    Zhang, Yuying
    Li, Wenjun
    Gong, Jian
    Li, Qiang
    ANIMALS, 2023, 13 (01):
  • [30] Effect of de novo transcriptome assembly on transcript quantification
    Hsieh, Ping-Han
    Oyang, Yen-Jen
    Chen, Chien-Yu
    SCIENTIFIC REPORTS, 2019, 9 (1)