Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles

被引:147
作者
Gadala-Maria, Daniel [1 ]
Yaari, Gur [2 ,4 ]
Uduman, Mohamed [2 ]
Kleinstein, Steven H. [1 ,2 ,3 ]
机构
[1] Yale Univ, Interdept Program Computat Biol & Bioinformat, New Haven, CT 06511 USA
[2] Yale Univ, Sch Med, Dept Pathol, New Haven, CT 06511 USA
[3] Yale Univ, Sch Med, Dept Immunobiol, New Haven, CT 06511 USA
[4] Bar Ilan Univ, Fac Engn, Bioengn Program, IL-5290002 Ramat Gan, Israel
基金
美国国家卫生研究院;
关键词
next-generation sequencing; B-cell repertoire; adaptive immunity; somatic hypermutation; variable gene segment; SOMATIC HYPERMUTATION; IG; IDENTIFICATION; REPERTOIRE; SIGNATURES; DIVERSITY; SELECTION; SYSTEM;
D O I
10.1073/pnas.1417683112
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Individual variation in germline and expressed B-cell immunoglobulin (Ig) repertoires has been associated with aging, disease susceptibility, and differential response to infection and vaccination. Repertoire properties can now be studied at large-scale through next-generation sequencing of rearranged Ig genes. Accurate analysis of these repertoire-sequencing (Rep-Seq) data requires identifying the germline variable (V), diversity (D), and joining (J) gene segments used by each Ig sequence. Current V(D) J assignment methods work by aligning sequences to a database of known germline V(D) J segment alleles. However, existing databases are likely to be incomplete and novel polymorphisms are hard to differentiate from the frequent occurrence of somatic hypermutations in Ig sequences. Here we develop a Tool for Ig Genotype Elucidation via Rep-Seq (TIgGER). TIgGER analyzes mutation patterns in Rep-Seq data to identify novel V segment alleles, and also constructs a personalized germline database containing the specific set of alleles carried by a subject. This information is then used to improve the initial V segment assignments from existing tools, like IMGT/HighV-QUEST. The application of TIgGER to Rep-Seq data from seven subjects identified 11 novel V segment alleles, including at least one in every subject examined. These novel alleles constituted 13% of the total number of unique alleles in these subjects, and impacted 3% of V(D) J segment assignments. These results reinforce the highly polymorphic nature of human Ig V genes, and suggest that many novel alleles remain to be discovered. The integration of TIgGER into Rep-Seq processing pipelines will increase the accuracy of V segment assignments, thus improving B-cell repertoire analyses.
引用
收藏
页码:E862 / E870
页数:9
相关论文
共 35 条
[1]  
Alamyar E, 2010, 11 JOURN OUV BIOL IN
[2]   Analysis of immune-related loci identifies 48 new susceptibility variants for multiple sclerosis [J].
Beecham, Ashley H. ;
Patsopoulos, Nikolaos A. ;
Xifara, Dionysia K. ;
Davis, Mary F. ;
Kemppinen, Anu ;
Cotsapas, Chris ;
Shah, Tejas S. ;
Spencer, Chris ;
Booth, David ;
Goris, An ;
Oturai, Annette ;
Saarela, Janna ;
Fontaine, Bertrand ;
Hemmer, Bernhard ;
Martin, Claes ;
Zipp, Frauke ;
D'Alfonso, Sandra ;
Martinelli-Boneschi, Filippo ;
Taylor, Bruce ;
Harbo, Hanne F. ;
Kockum, Ingrid ;
Hillert, Jan ;
Olsson, Tomas ;
Ban, Maria ;
Oksenberg, Jorge R. ;
Hintzen, Rogier ;
Barcellos, Lisa F. ;
Agliardi, Cristina ;
Alfredsson, Lars ;
Alizadeh, Mehdi ;
Anderson, Carl ;
Andrews, Robert ;
Sondergaard, Helle Bach ;
Baker, Amie ;
Band, Gavin ;
Baranzini, Sergio E. ;
Barizzone, Nadia ;
Barrett, Jeffrey ;
Bellenguez, Celine ;
Bergamaschi, Laura ;
Bernardinelli, Luisa ;
Berthele, Achim ;
Biberacher, Viola ;
Binder, Thomas M. C. ;
Blackburn, Hannah ;
Bomfim, Izaura L. ;
Brambilla, Paola ;
Broadley, Simon ;
Brochet, Bruno ;
Brundin, Lou .
NATURE GENETICS, 2013, 45 (11) :1353-+
[3]   Rep-Seq: uncovering the immunological repertoire through next-generation sequencing [J].
Benichou, Jennifer ;
Ben-Hamo, Rotem ;
Louzoun, Yoram ;
Efroni, Sol .
IMMUNOLOGY, 2012, 135 (03) :183-191
[4]   Human lymphocyte repertoires in ageing [J].
Boyd, Scott D. ;
Liu, Yi ;
Wang, Chen ;
Martin, Victoria ;
Dunn-Walters, Deborah K. .
CURRENT OPINION IN IMMUNOLOGY, 2013, 25 (04) :511-515
[5]   Individual Variation in the Germline Ig Gene Repertoire Inferred from Variable Region Gene Rearrangements [J].
Boyd, Scott D. ;
Gaeta, Bruno A. ;
Jackson, Katherine J. ;
Fire, Andrew Z. ;
Marshall, Eleanor L. ;
Merker, Jason D. ;
Maniar, Jay M. ;
Zhang, Lyndon N. ;
Sahaf, Bita ;
Jones, Carol D. ;
Simen, Birgitte B. ;
Hanczaruk, Bozena ;
Nguyen, Khoa D. ;
Nadeau, Kari C. ;
Egholm, Michael ;
Miklos, David B. ;
Zehnder, James L. ;
Collins, Andrew M. .
JOURNAL OF IMMUNOLOGY, 2010, 184 (12) :6986-6992
[6]   Measurement and Clinical Monitoring of Human Lymphocyte Clonality by Massively Parallel V-D-J Pyrosequencing [J].
Boyd, Scott D. ;
Marshall, Eleanor L. ;
Merker, Jason D. ;
Maniar, Jay M. ;
Zhang, Lyndon N. ;
Sahaf, Bita ;
Jones, Carol D. ;
Simen, Birgitte B. ;
Hanczaruk, Bozena ;
Nguyen, Khoa D. ;
Nadeau, Kari C. ;
Egholm, Michael ;
Miklos, David B. ;
Zehnder, James L. ;
Fire, Andrew Z. .
SCIENCE TRANSLATIONAL MEDICINE, 2009, 1 (12)
[7]   IMGT/V-QUEST: the highly customized and integrated system for IG and TR standardized V-J and V-D-J sequence analysis [J].
Brochet, Xavier ;
Lefranc, Marie-Paule ;
Giudicelli, Veronique .
NUCLEIC ACIDS RESEARCH, 2008, 36 :W503-W508
[8]   Genetic susceptibility to SLE: Recent progress from GWAS [J].
Cui, Yong ;
Sheng, Yujun ;
Zhang, Xuejun .
JOURNAL OF AUTOIMMUNITY, 2013, 41 :25-33
[9]   Interrogating the major histocompatibility complex with high-throughput genomics [J].
de Bakker, Paul I. W. ;
Raychaudhuri, Soumya .
HUMAN MOLECULAR GENETICS, 2012, 21 :R29-R36
[10]   Personalized, sequencing-based immune profiling spurs startups [J].
Eisenstein, Michael .
NATURE BIOTECHNOLOGY, 2013, 31 (03) :184-186