Identification of errors introduced during high throughput sequencing of the T cell receptor repertoire

被引:55
作者
Phuong Nguyen [1 ]
Ma, Jing [2 ]
Pei, Deqing [3 ]
Obert, Caroline [4 ]
Cheng, Cheng [3 ]
Geiger, Terrence L. [1 ]
机构
[1] St Jude Childrens Hosp, Dept Pathol, Memphis, TN 38105 USA
[2] St Jude Childrens Hosp, Dept Informat Sci, Memphis, TN 38105 USA
[3] St Jude Childrens Hosp, Dept Biostat, Memphis, TN 38105 USA
[4] St Jude Childrens Hosp, Hartwell Ctr Bioinformat, Memphis, TN 38105 USA
关键词
TCR REPERTOIRE; DIVERSITY; INFECTION; SELECTION; MOUSE;
D O I
10.1186/1471-2164-12-106
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Recent advances in massively parallel sequencing have increased the depth at which T cell receptor (TCR) repertoires can be probed by >3log10, allowing for saturation sequencing of immune repertoires. The resolution of this sequencing is dependent on its accuracy, and direct assessments of the errors formed during high throughput repertoire analyses are limited. Results: We analyzed 3 monoclonal TCR from TCR transgenic, Rag(-/-) mice using Illumina (R) sequencing. A total of 27 sequencing reactions were performed for each TCR using a trifurcating design in which samples were divided into 3 at significant processing junctures. More than 20 million complementarity determining region (CDR) 3 sequences were analyzed. Filtering for lower quality sequences diminished but did not eliminate sequence errors, which occurred within 1-6% of sequences. Erroneous sequences were pre-dominantly of correct length and contained single nucleotide substitutions. Rates of specific substitutions varied dramatically in a position-dependent manner. Four substitutions, all purine-pyrimidine transversions, predominated. Solid phase amplification and sequencing rather than liquid sample amplification and preparation appeared to be the primary sources of error. Analysis of polyclonal repertoires demonstrated the impact of error accumulation on data parameters. Conclusions: Caution is needed in interpreting repertoire data due to potential contamination with mis-sequence reads. However, a high association of errors with phred score, high relatedness of erroneous sequences with the parental sequence, dominance of specific nt substitutions, and skewed ratio of forward to reverse reads among erroneous sequences indicate approaches to filter erroneous sequences from repertoire data sets.
引用
收藏
页数:13
相关论文
共 27 条
[1]   A direct estimate of the human αβ T cell receptor diversity [J].
Arstila, TP ;
Casrouge, A ;
Baron, V ;
Even, J ;
Kanellopoulos, J ;
Kourilsky, P .
SCIENCE, 1999, 286 (5441) :958-961
[2]   AN RNA-POLYMERASE MUTANT WITH REDUCED ACCURACY OF CHAIN ELONGATION [J].
BLANK, A ;
GALLANT, JA ;
BURGESS, RR ;
LOEB, LA .
BIOCHEMISTRY, 1986, 25 (20) :5920-5928
[3]   New perspectives for large-scale repertoire analysis of immune receptors [J].
Boudinot, Pierre ;
Marriotti-Ferrandiz, Maria Encarnita ;
Du Pasquier, Louis ;
Benmansour, Abdenour ;
Cazenave, Pierre-Andre ;
Six, Adrien .
MOLECULAR IMMUNOLOGY, 2008, 45 (09) :2437-2445
[4]   Size estimate of the αβ TCR repertoire of naive mouse splenocytes [J].
Casrouge, A ;
Beaudoing, E ;
Dalle, S ;
Pannetier, C ;
Kanellopoulos, J ;
Kourilsky, P .
JOURNAL OF IMMUNOLOGY, 2000, 164 (11) :5782-5787
[5]  
Currier Jeffrey R, 2001, Curr Protoc Immunol, VChapter 10, DOI 10.1002/0471142735.im1028s38
[6]   Rapid CD8+ T cell repertoire focusing and selection of high-affinity clones into memory following primary infection with a persistent human virus:: Human Cytomegalovirus [J].
Day, Elizabeth K. ;
Carmichael, Andrew J. ;
Ten Berge, Ineke J. M. ;
Waller, Edward C. P. ;
Sissons, J. G. Patrick ;
Wills, Mark R. .
JOURNAL OF IMMUNOLOGY, 2007, 179 (05) :3203-3213
[7]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[8]   Profiling the T-cell receptor beta-chain repertoire by massively parallel sequencing [J].
Freeman, J. Douglas ;
Warren, Rene L. ;
Webb, John R. ;
Nelson, Brad H. ;
Holt, Robert A. .
GENOME RESEARCH, 2009, 19 (10) :1817-1824
[9]   Recognition of the peripheral self by naturally arising CD25+ CD4+ T cell receptors [J].
Hsieh, CS ;
Liang, YQ ;
Tyznik, AJ ;
Self, SG ;
Liggitt, D ;
Rudensky, AY .
IMMUNITY, 2004, 21 (02) :267-277
[10]  
Jouvin-Marche E, 2009, ADV EXP MED BIOL, V650, P82