PRICE: Software for the Targeted Assembly of Components of (Meta) Genomic Sequence Data

被引:212
|
作者
Ruby, J. Graham [1 ,3 ]
Bellare, Priya [2 ,3 ]
DeRisi, Joseph L. [1 ,3 ]
机构
[1] Univ Calif San Francisco, Dept Biochem & Biophys, San Francisco, CA 94044 USA
[2] Univ Calif San Francisco, GW Hooper Fdn Labs, San Francisco, CA 94044 USA
[3] Howard Hughes Med Inst, Chevy Chase, MD 20815 USA
来源
G3-GENES GENOMES GENETICS | 2013年 / 3卷 / 05期
关键词
KSHV; de novo genome assembly; high-throughput DNA sequencing; metagenomics; SARCOMA-ASSOCIATED HERPESVIRUS; DE-NOVO ASSEMBLER; SHORT READS; ALGORITHMS; PARALLEL; MILLIONS; SEARCH; VELVET; CELLS; IDBA;
D O I
10.1534/g3.113.005967
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Low-cost DNA sequencing technologies have expanded the role for direct nucleic acid sequencing in the analysis of genomes, transcriptomes, and the metagenomes of whole ecosystems. Human and machine comprehension of such large datasets can be simplified via synthesis of sequence fragments into long, contiguous blocks of sequence (contigs), but most of the progress in the field of assembly has focused on genomes in isolation rather than metagenomes. Here, we present software for paired-read iterative contig extension (PRICE), a strategy for focused assembly of particular nucleic acid species using complex metagenomic data as input. We describe the assembly strategy implemented by PRICE and provide examples of its application to the sequence of particular genes, transcripts, and virus genomes from complex multicomponent datasets, including an assembly of the BCBL-1 strain of Kaposi's sarcoma-associated herpesvirus. PRICE is open-source and available for free download (derisilab.ucsf.edu/software/price/ or sourceforge.net/projects/pricedenovo/).
引用
收藏
页码:865 / 880
页数:16
相关论文
共 23 条
  • [1] ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data
    Kieser, Silas
    Brown, Joseph
    Zdobnov, Evgeny M.
    Trajkovski, Mirko
    McCue, Lee Ann
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [2] ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data
    Silas Kieser
    Joseph Brown
    Evgeny M. Zdobnov
    Mirko Trajkovski
    Lee Ann McCue
    BMC Bioinformatics, 21
  • [3] Sequence assembly using next generation sequencing data-challenges and solutions
    Chin, Francis Y. L.
    Leung, Henry C. M.
    Yiu, S. M.
    SCIENCE CHINA-LIFE SCIENCES, 2014, 57 (11) : 1140 - 1148
  • [4] A Scalable and Accurate Targeted Gene Assembly Tool (SAT-Assembler) for Next-Generation Sequencing Data
    Zhang, Yuan
    Sun, Yanni
    Cole, James R.
    PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (08)
  • [5] HapCompass: A Fast Cycle Basis Algorithm for Accurate Haplotype Assembly of Sequence Data
    Aguiar, Derek
    Istrail, Sorin
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 577 - 590
  • [6] An Assembly Sequence Planning Framework for Complex Data using General Voronoi Diagram
    Dorn, Sebastian
    Wolpert, Nicola
    Schoemer, Elmar
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 9896 - 9902
  • [7] Evaluation of variant detection software for pooled next-generation sequence data
    Huang, Howard W.
    Mullikin, James C.
    Hansen, Nancy F.
    BMC BIOINFORMATICS, 2015, 16
  • [8] Evaluation of variant detection software for pooled next-generation sequence data
    Howard W. Huang
    James C. Mullikin
    Nancy F. Hansen
    BMC Bioinformatics, 16
  • [9] New Genetics and Genomic Data on Pancreatic Neuroendocrine Tumors: Implications for Diagnosis, Treatment, and Targeted Therapies
    Schmitt, Anja M.
    Marinoni, Ilaria
    Blank, Annika
    Perren, Aurel
    ENDOCRINE PATHOLOGY, 2016, 27 (03) : 200 - 204
  • [10] ContigExtender: a new approach to improving de novo sequence assembly for viral metagenomics data
    Zachary Deng
    Eric Delwart
    BMC Bioinformatics, 22