pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires

被引:314
作者
Vander Heiden, Jason A. [1 ]
Yaari, Gur [2 ,3 ]
Uduman, Mohamed [1 ,3 ]
Stern, Joel N. H. [4 ,5 ,6 ]
O'Connor, Kevin C. [5 ,6 ]
Hafler, David A. [5 ,6 ]
Vigneault, Francois [7 ]
Kleinstein, Steven H. [1 ,3 ]
机构
[1] Yale Univ, Interdept Program Computat Biol & Bioinformat, New Haven, CT 06511 USA
[2] Bar Ilan Univ, Fac Engn, Bioengn Program, IL-52900 Ramat Gan, Israel
[3] Yale Univ, Sch Med, Dept Pathol, New Haven, CT 06511 USA
[4] Hofstra Univ, Hofstra N Shore LIJ Sch Med, Dept Sci Educ, Hempstead, NY 11530 USA
[5] Yale Univ, Sch Med, Dept Neurol, New Haven, CT 06511 USA
[6] Yale Univ, Sch Med, Human & Translat Immunol Program, New Haven, CT 06511 USA
[7] ABVitro Inc, Boston, MA 02210 USA
基金
美国国家卫生研究院;
关键词
IMMUNOGLOBULIN; INFERENCE; ALIGNMENT;
D O I
10.1093/bioinformatics/btu138
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Driven by dramatic technological improvements, large-scale characterization of lymphocyte receptor repertoires via high-throughput sequencing is now feasible. Although promising, the high germline and somatic diversity, especially of B-cell immunoglobulin repertoires, presents challenges for analysis requiring the development of specialized computational pipelines. We developed the REpertoire Sequencing TOolkit (pRESTO) for processing reads from high-throughput lymphocyte receptor studies. pRESTO processes raw sequences to produce error-corrected, sorted and annotated sequence sets, along with a wealth of metrics at each step. The toolkit supports multiplexed primer pools, single-or paired-end reads and emerging technologies that use single-molecule identifiers. pRESTO has been tested on data generated from Roche and Illumina platforms. It has a built-in capacity to parallelize the work between available processors and is able to efficiently process millions of sequences generated by typical high-throughput projects. Availability and implementation: pRESTO is freely available for academic use. The software package and detailed tutorials may be downloaded from http://clip.med.yale.edu/presto.
引用
收藏
页码:1930 / 1932
页数:3
相关论文
共 13 条
[1]  
Alamyar E., 2012, Immunome Res., V8, P26
[2]   IgTree©:: Creating immunoglobulin variable region gene lineage trees [J].
Barak, Michal ;
Zuckerman, Neta S. ;
Edelman, Hanna ;
Unger, Ron ;
Mehr, Ramit .
JOURNAL OF IMMUNOLOGICAL METHODS, 2008, 338 (1-2) :67-74
[3]   Rep-Seq: uncovering the immunological repertoire through next-generation sequencing [J].
Benichou, Jennifer ;
Ben-Hamo, Rotem ;
Louzoun, Yoram ;
Efroni, Sol .
IMMUNOLOGY, 2012, 135 (03) :183-191
[4]  
Chen Zhiliang, 2010, Immunome Res, V6 Suppl 1, pS4, DOI 10.1186/1745-7580-6-S1-S4
[5]   iHMMune-align:: hidden Markov model-based alignment and identification of germline genes in rearranged immunoglobulin gene sequences [J].
Gaeta, Bruno A. ;
Malming, Harald R. ;
Jackson, Katherine J. L. ;
Bain, Michael E. ;
Wilson, Patrick ;
Collins, Andrew M. .
BIOINFORMATICS, 2007, 23 (13) :1580-1587
[6]   The Inference of Phased Haplotypes for the Immunoglobulin H Chain V Region Gene Loci by Analysis of VDJ Gene Rearrangements [J].
Kidd, Marie J. ;
Chen, Zhiliang ;
Wang, Yan ;
Jackson, Katherine J. ;
Zhang, Lyndon ;
Boyd, Scott D. ;
Fire, Andrew Z. ;
Tanaka, Mark M. ;
Gaeta, Bruno A. ;
Collins, Andrew M. .
JOURNAL OF IMMUNOLOGY, 2012, 188 (03) :1333-1340
[7]   Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes [J].
Shiroguchi, Katsuyuki ;
Jia, Tony Z. ;
Sims, Peter A. ;
Xie, X. Sunney .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (04) :1347-1352
[8]   Decombinator: a tool for fast, efficient gene assignment in T-cell receptor sequences using a finite state machine [J].
Thomas, Niclas ;
Heather, James ;
Ndifon, Wilfred ;
Shawe-Taylor, John ;
Chain, Benjamin .
BIOINFORMATICS, 2013, 29 (05) :542-550
[9]   Genetic measurement of memory B-cell recall using antibody repertoire sequencing [J].
Vollmers, Christopher ;
Sit, Rene V. ;
Weinstein, Joshua A. ;
Dekker, Cornelia L. ;
Quake, Stephen R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (33) :13463-13468
[10]   SoDA: implementation of a 3D alignment algorithm for inference of antigen receptor recombinations [J].
Volpe, JM ;
Cowell, LG ;
Kepler, TB .
BIOINFORMATICS, 2006, 22 (04) :438-444