Hierarchical Clustering Can Identify B Cell Clones with High Confidence in Ig Repertoire Sequencing Data

被引:92
作者
Gupta, Namita T. [1 ]
Adams, Kristofor D. [2 ]
Briggs, Adrian W. [2 ]
Timberlake, Sonia C. [2 ]
Vigneault, Francois [2 ]
Kleinstein, Steven H. [1 ,3 ,4 ]
机构
[1] Yale Univ, Interdept Program Computat Biol & Bioinfomat, New Haven, CT 06520 USA
[2] AbVitro, Boston, MA 02210 USA
[3] Yale Sch Med, Dept Immunol, New Haven, CT 06520 USA
[4] Yale Sch Med, Dept Pathol, New Haven, CT 06520 USA
基金
美国国家卫生研究院;
关键词
HIV-1-NEUTRALIZING ANTIBODIES; IMMUNOGLOBULIN; GENERATION; IDENTIFICATION; MATURATION; DIVERSITY; INFECTION; RITUXIMAB; SELECTION; TOOLKIT;
D O I
10.4049/jimmunol.1601850
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Adaptive immunity is driven by the expansion, somatic hypermutation, and selection of B cell clones. Each clone is the progeny of a single B cell responding to Ag, with diversified Ig receptors. These receptors can now be profiled on a large scale by nextgeneration sequencing. Such data provide a window into the microevolutionary dynamics that drive successful immune responses and the dysregulation that occurs with aging or disease. Clonal relationships are not directly measured, but they must be computationally inferred from these sequencing data. Although several hierarchical clustering-based methods have been proposed, they vary in distance and linkage methods and have not yet been rigorously compared. In this study, we use a combination of human experimental and simulated data to characterize the performance of hierarchical clustering-based methods for partitioning sequences into clones. We find that single linkage clustering has high performance, with specificity, sensitivity, and positive predictive value all > 99%, whereas other linkages result in a significant loss of sensitivity. Surprisingly, distance metrics that incorporate the biases of somatic hypermutation do not outperform simple Hamming distance. Although errors were more likely in sequences with short junctions, using the entire dataset to choose a single distance threshold for clustering is near optimal. Our results suggest that hierarchical clustering using single linkage with Hamming distance identifies clones with high confidence and provides a fully automated method for clonal grouping. The performance estimates we develop provide important context to interpret clonal analysis of repertoire sequencing data and allow for rigorous testing of other clonal grouping algorithms.
引用
收藏
页码:2489 / 2499
页数:11
相关论文
共 60 条
  • [1] Vaccination-induced changes in human B-cell repertoire and pneumococcal IgM and IgA antibody at different ages
    Ademokun, Alexander
    Wu, Yu-Chang
    Martin, Victoria
    Mitra, Rajive
    Sack, Ulrich
    Baxendale, Helen
    Kipling, David
    Dunn-Walters, Deborah K.
    [J]. AGING CELL, 2011, 10 (06) : 922 - 930
  • [2] Alamyar Eltaf, 2012, Methods Mol Biol, V882, P569, DOI 10.1007/978-1-61779-842-9_32
  • [3] [Anonymous], 1995, MONOGRAPHS STAT APPL
  • [4] [Anonymous], 2013, EL STAT TXB
  • [5] Rep-Seq: uncovering the immunological repertoire through next-generation sequencing
    Benichou, Jennifer
    Ben-Hamo, Rotem
    Louzoun, Yoram
    Efroni, Sol
    [J]. IMMUNOLOGY, 2012, 135 (03) : 183 - 191
  • [6] Rituximab and mycophenolate mofetil for relapsing proliferative lupus nephritis: a long-term prospective study
    Boletis, John N.
    Marinaki, Smaragde
    Skalioti, Chryssanthe
    Lionaki, Sofia S.
    Iniotaki, Aliki
    Sfikakis, Petros P.
    [J]. NEPHROLOGY DIALYSIS TRANSPLANTATION, 2009, 24 (07) : 2157 - 2160
  • [7] High-Throughput DNA Sequencing Analysis of Antibody Repertoires
    Boyd, Scott D.
    Joshi, Shilpa A.
    [J]. MICROBIOLOGY SPECTRUM, 2014, 2 (05):
  • [8] Measurement and Clinical Monitoring of Human Lymphocyte Clonality by Massively Parallel V-D-J Pyrosequencing
    Boyd, Scott D.
    Marshall, Eleanor L.
    Merker, Jason D.
    Maniar, Jay M.
    Zhang, Lyndon N.
    Sahaf, Bita
    Jones, Carol D.
    Simen, Birgitte B.
    Hanczaruk, Bozena
    Nguyen, Khoa D.
    Nadeau, Kari C.
    Egholm, Michael
    Miklos, David B.
    Zehnder, James L.
    Fire, Andrew Z.
    [J]. SCIENCE TRANSLATIONAL MEDICINE, 2009, 1 (12)
  • [9] Clonify: unseeded antibody lineage assignment from next-generation sequencing data
    Briney, Bryan
    Le, Khoa
    Zhu, Jiang
    Burton, Dennis R.
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [10] Chen Zhiliang, 2010, Immunome Res, V6 Suppl 1, pS4, DOI 10.1186/1745-7580-6-S1-S4