PipeCraft: Flexible open-source toolkit for bioinformatics analysis of custom high-throughput amplicon sequencing data

被引:126
作者
Anslan, Sten [1 ]
Bahram, Mohammad [1 ,2 ]
Hiiesalu, Indrek [1 ]
Tedersoo, Leho [3 ]
机构
[1] Univ Tartu, Inst Ecol & Earth Sci, Tartu, Estonia
[2] Uppsala Univ, Dept Organismal Biol, Evolutionary Biol Ctr, Uppsala, Sweden
[3] Univ Tartu, Nat Hist Museum, Tartu, Estonia
关键词
high-throughput sequencing; metabarcoding; pipeline; sequencing data analysis; software; SUBUNIT RIBOSOMAL-RNA; MOLECULAR-IDENTIFICATION; HYPERVARIABLE REGIONS; PIPELINE; COMMUNITIES; DIVERSITY; SOFTWARE; READS; FUNGI;
D O I
10.1111/1755-0998.12692
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput sequencing methods have become a routine analysis tool in environmental sciences as well as in public and private sector. These methods provide vast amount of data, which need to be analysed in several steps. Although the bioinformatics may be applied using several public tools, many analytical pipelines allow too few options for the optimal analysis for more complicated or customized designs. Here, we introduce PipeCraft, a flexible and handy bioinformatics pipeline with a user-friendly graphical interface that links several public tools for analysing amplicon sequencing data. Users are able to customize the pipeline by selecting the most suitable tools and options to process raw sequences from Illumina, Pacific Biosciences, Ion Torrent and Roche 454 sequencing platforms. We described the design and options of PipeCraft and evaluated its performance by analysing the data sets from three different sequencing platforms. We demonstrated that PipeCraft is able to process large data sets within 24hr. The graphical user interface and the automated links between various bioinformatics tools enable easy customization of the workflow. All analytical steps and options are recorded in log files and are easily traceable.
引用
收藏
页码:e234 / e240
页数:7
相关论文
共 52 条
[1]   PlutoF-a Web Based Workbench for Ecological and Taxonomic Research, with an Online Implementation for Fungal ITS Sequences [J].
Abarenkov, Kessy ;
Tedersoo, Leho ;
Nilsson, R. Henrik ;
Vellak, Kai ;
Saar, Irja ;
Veldre, Vilmar ;
Parmasto, Erast ;
Prous, Marko ;
Aan, Anne ;
Ots, Margus ;
Kurina, Olavi ;
Ostonen, Ivika ;
Jogeva, Janno ;
Halapuu, Siim ;
Poldmaa, Kadri ;
Toots, Maert ;
Truu, Jaak ;
Larsson, Karl-Henrik ;
Koljalg, Urmas .
EVOLUTIONARY BIOINFORMATICS, 2010, 6 :189-196
[2]   The UNITE database for molecular identification of fungi - recent updates and future perspectives [J].
Abarenkov, Kessy ;
Nilsson, R. Henrik ;
Larsson, Karl-Henrik ;
Alexander, Ian J. ;
Eberhardt, Ursula ;
Erland, Susanne ;
Hoiland, Klaus ;
Kjoller, Rasmus ;
Larsson, Ellen ;
Pennanen, Taina ;
Sen, Robin ;
Taylor, Andy F. S. ;
Tedersoo, Leho ;
Ursing, Bjorn M. ;
Vralstad, Trude ;
Liimatainen, Kare ;
Peintner, Ursula ;
Koljalg, Urmas .
NEW PHYTOLOGIST, 2010, 186 (02) :281-285
[3]   MICCA: a complete and accurate software for taxonomic profiling of metagenomic data [J].
Albanese, Davide ;
Fontana, Paolo ;
De Filippo, Carlotta ;
Cavalieri, Duccio ;
Donati, Claudio .
SCIENTIFIC REPORTS, 2015, 5
[4]   A Method for Studying Protistan Diversity Using Massively Parallel Sequencing of V9 Hypervariable Regions of Small-Subunit Ribosomal RNA Genes [J].
Amaral-Zettler, Linda A. ;
McCliment, Elizabeth A. ;
Ducklow, Hugh W. ;
Huse, Susan M. .
PLOS ONE, 2009, 4 (07)
[5]   Millions of reads, thousands of taxa: microbial community structure and associations analyzed via marker genes [J].
Balint, Miklos ;
Bahram, Mohammad ;
Eren, A. Murat ;
Faust, Karoline ;
Fuhrman, Jed A. ;
Lindahl, Bjorn ;
O'Hara, Robert B. ;
Opik, Maarja ;
Sogin, Mitchell L. ;
Unterseher, Martin ;
Tedersoo, Leho .
FEMS MICROBIOLOGY REVIEWS, 2016, 40 (05) :686-700
[6]   An Illumina metabarcoding pipeline for fungi [J].
Balint, Miklos ;
Schmidt, Philipp-Andre ;
Sharma, Rahul ;
Thines, Marco ;
Schmitt, Imke .
ECOLOGY AND EVOLUTION, 2014, 4 (13) :2642-2653
[7]   metaxa2: improved identification and taxonomic classification of small and large subunit rRNA in metagenomic data [J].
Bengtsson-Palme, Johan ;
Hartmann, Martin ;
Eriksson, Karl Martin ;
Pal, Chandan ;
Thorell, Kaisa ;
Larsson, Dan Goran Joakim ;
Nilsson, Rolf Henrik .
MOLECULAR ECOLOGY RESOURCES, 2015, 15 (06) :1403-1414
[8]   Improved software detection and extraction of ITS1 and ITS2 from ribosomal ITS sequences of fungi and other eukaryotes for analysis of environmental sequencing data [J].
Bengtsson-Palme, Johan ;
Ryberg, Martin ;
Hartmann, Martin ;
Branco, Sara ;
Wang, Zheng ;
Godhe, Anna ;
De Wit, Pierre ;
Sanchez-Garcia, Marisol ;
Ebersberger, Ingo ;
de Sousa, Filipe ;
Amend, Anthony S. ;
Jumpponen, Ari ;
Unterseher, Martin ;
Kristiansson, Erik ;
Abarenkov, Kessy ;
Bertrand, Yann J. K. ;
Sanli, Kemal ;
Eriksson, K. Martin ;
Vik, Unni ;
Veldre, Vilmar ;
Nilsson, R. Henrik .
METHODS IN ECOLOGY AND EVOLUTION, 2013, 4 (10) :914-919
[9]   OBITOOLS: a UNIX-inspired software package for DNA metabarcoding [J].
Boyer, Frederic ;
Mercier, Celine ;
Bonin, Aurelie ;
Le Bras, Yvan ;
Taberlet, Pierre ;
Coissac, Eric .
MOLECULAR ECOLOGY RESOURCES, 2016, 16 (01) :176-182
[10]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10