PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data

被引:118
作者
Anslan, Sten [1 ]
Bahram, Mohammad [1 ,2 ]
Hiiesalu, Indrek [1 ]
Tedersoo, Leho [3 ]
机构
[1] Univ Tartu, Inst Ecol & Earth Sci, Tartu, Estonia
[2] Uppsala Univ, Dept Organismal Biol, Evolutionary Biol Ctr, Uppsala, Sweden
[3] Univ Tartu, Nat Hist Museum, Tartu, Estonia
关键词
high-throughput sequencing; metabarcoding; pipeline; sequencing data analysis; software; SUBUNIT RIBOSOMAL-RNA; MOLECULAR-IDENTIFICATION; HYPERVARIABLE REGIONS; PIPELINE; COMMUNITIES; DIVERSITY; SOFTWARE; READS; FUNGI;
D O I
10.1111/1755-0998.12692
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.
引用
收藏
页码:e234 / e240
页数:7
相关论文
共 47 条
  • [21] A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection
    Goodacre, Norman
    Aljanahi, Aisha
    Nandakumar, Subhiksha
    Mikailov, Mike
    Khan, Arifa S.
    MSPHERE, 2018, 3 (02)
  • [22] Tools and best practices for retrotransposon analysis using high-throughput sequencing data
    Aurélie Teissandier
    Nicolas Servant
    Emmanuel Barillot
    Deborah Bourc’his
    Mobile DNA, 10
  • [23] Identification and characterization of microRNAs related to salt stress in broccoli, using high-throughput sequencing and bioinformatics analysis
    Yunhong Tian
    Yunming Tian
    Xiaojun Luo
    Tao Zhou
    Zuoping Huang
    Ying Liu
    Yihan Qiu
    Bing Hou
    Dan Sun
    Hongyu Deng
    Shen Qian
    Kaitai Yao
    BMC Plant Biology, 14
  • [24] Tools and best practices for retrotransposon analysis using high-throughput sequencing data
    Teissandier, Aurelie
    Servant, Nicolas
    Barillot, Emmanuel
    Bourc'his, Deborah
    MOBILE DNA, 2019, 10 (01)
  • [25] Identification of Small RNAs Associated with Salt Stress in Chrysanthemums through High-Throughput Sequencing and Bioinformatics Analysis
    Nai, Jiefei
    Ma, Tieming
    Liu, Yingjie
    Zhou, Yunwei
    GENES, 2023, 14 (03)
  • [26] Identification and characterization of microRNAs related to salt stress in broccoli, using high-throughput sequencing and bioinformatics analysis
    Tian, Yunhong
    Tian, Yunming
    Luo, Xiaojun
    Zhou, Tao
    Huang, Zuoping
    Liu, Ying
    Qiu, Yihan
    Hou, Bing
    Sun, Dan
    Deng, Hongyu
    Qian, Shen
    Yao, Kaitai
    BMC PLANT BIOLOGY, 2014, 14
  • [27] Systematic Analysis of the Association between Gut Flora and Obesity through High-Throughput Sequencing and Bioinformatics Approaches
    Chiu, Chih-Min
    Huang, Wei-Chih
    Weng, Shun-Long
    Tseng, Han-Chi
    Liang, Chao
    Wang, Wei-Chi
    Yang, Ting
    Yang, Tzu-Ling
    Weng, Chen-Tsung
    Chang, Tzu-Hao
    Huang, Hsien-Da
    BIOMED RESEARCH INTERNATIONAL, 2014, 2014
  • [28] Statistical and Computational Methods for High-Throughput Sequencing Data Analysis of Alternative Splicing
    Chen L.
    Statistics in Biosciences, 2013, 5 (1) : 138 - 155
  • [29] SaDA: From Sampling to Data Analysis-An Extensible Open Source Infrastructure for Rapid, Robust and Automated Management and Analysis of Modern Ecological High-Throughput Microarray Data
    Singh, Kumar Saurabh
    Thual, Dominique
    Spurio, Roberto
    Cannata, Nicola
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2015, 12 (06) : 6352 - 6366
  • [30] High-throughput Sequencing and Bioinformatics Analysis Reveals the Neurogenesis Key Targets of Curcumin Action in Mouse Brain with MCAO
    Li, Litao
    Cheng, Jinming
    Ji, Yingxiao
    Liu, Jihong
    Zhai, Rui
    Wang, Hebo
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2023, 26 (06) : 1233 - 1241