Using the antibody-antigen binding interface to train image-based deep neural networks for antibody-epitope classification

被引:18
作者
Ripoll, Daniel R. [1 ,2 ]
Chaudhury, Sidhartha [1 ,3 ]
Wallqvist, Anders [1 ]
机构
[1] US Army Med Res & Dev Command, DoD Biotechnol High Performance Comp Software App, Telemed & Adv Technol Res Ctr, Ft Detrick, MD 21702 USA
[2] Henry M Jackson Fdn Adv Mil Med Inc HJF, Bethesda, MD USA
[3] Walter Reed Army Inst Res, Ctr Enabling Capabil, Silver Spring, MD USA
关键词
EBOLA-VIRUS GP; NEUTRALIZING ANTIBODIES; MONOCLONAL-ANTIBODIES; STRUCTURAL BASIS; SURVIVOR; PREDICTION; COMPLEX; TOOL; HIV;
D O I
10.1371/journal.pcbi.1008864
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
High-throughput B-cell sequencing has opened up new avenues for investigating complex mechanisms underlying our adaptive immune response. These technological advances drive data generation and the need to mine and analyze the information contained in these large datasets, in particular the identification of therapeutic antibodies (Abs) or those associated with disease exposure and protection. Here, we describe our efforts to use artificial intelligence (AI)-based image-analyses for prospective classification of Abs based solely on sequence information. We hypothesized that Abs recognizing the same part of an antigen share a limited set of features at the binding interface, and that the binding site regions of these Abs share share common structure and physicochemical property patterns that can serve as a "fingerprint" to recognize uncharacterized Abs. We combined large-scale sequence-based protein-structure predictions to generate ensembles of 3-D Ab models, reduced the Ab binding interface to a 2-D image (fingerprint), used pre-trained convolutional neural networks to extract features, and trained deep neural networks (DNNs) to classify Abs. We evaluated this approach using Ab sequences derived from human HIV and Ebola viral infections to differentiate between two Abs, Abs belonging to specific B-cell family lineages, and Abs with different epitope preferences. In addition, we explored a different type of DNN method to detect one class of Abs from a larger pool of Abs. Testing on Ab sets that had been kept aside during model training, we achieved average prediction accuracies ranging from 71-96% depending on the complexity of the classification task. The high level of accuracies reached during these classification tests suggests that the DNN models were able to learn a series of structural patterns shared by Abs belonging to the same class. The developed methodology provides a means to apply AI-based image recognition techniques to analyze high-throughput B-cell sequencing datasets (repertoires) for Ab classification. Author summary The ability to take advantage of the rapid progress in AI for biological and medical application oftentimes requires looking at the problem from a non-traditional point-of-view. The adaptive immune system plays a key role in providing long-term immunity against pathogens. The repertoire of circulating B-cells that produce unique pathogen-specific antibodies in an individual contains immense information on both the status of the immune response at particular time and that individual's immune history. With high-throughput sequencing, we can now obtain Ab sequences for thousands of B cells from a single patient blood sample, but functionally characterizing antibodies on this scale remains on daunting task. Here, we propose to use AI to functionally classify Abs from sequence alone by re-casting this classification problem as an image recognition problem. Just as traditional image recognition involves training AI to distinguish different types of objects, we sought to use AI to distinguish different types of Ab-antigen binding interfaces. Towards that end, we generated ensembles of Ab structures from sequence, and generated 2-D 'fingerprints' of each structure that captures the essential molecular and chemical structure of the Ab binding site regions, and trained a Convolution and Deep Neural Network based AI model to classify Ab fingerprints associated with different functional characteristics. We applied this DNN-based approach to accurately predict antibody family lineage and epitope specificity against Ebola and HIV-1 viruses, and to detect sequence-diverse antibodies with similar binding properties as the ones we used for training.
引用
收藏
页数:42
相关论文
共 68 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [3] Baras A., 2018, AI MHC ALLELE INTEGR, DOI [10.1101/318881, DOI 10.1101/318881]
  • [4] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [5] Predicting peptide presentation by major histocompatibility complex class I: an improved machine learning approach to the immunopeptidome
    Boehm, Kevin Michael
    Bhinder, Bhavneet
    Raja, Vijay Joseph
    Dephoure, Noah
    Elemento, Olivier
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [6] Isolation of potent neutralizing antibodies from a survivor of the 2014 Ebola virus outbreak
    Bornholdt, Zachary A.
    Turner, Hannah L.
    Murin, Charles D.
    Li, Wen
    Sok, Devin
    Souders, Colby A.
    Piper, Ashley E.
    Goff, Arthur
    Shamblin, Joshua D.
    Wollen, Suzanne E.
    Sprague, Thomas R.
    Fusco, Marnie L.
    Pommert, Kathleen B. J.
    Cavacini, Lisa A.
    Smith, Heidi L.
    Klempner, Mark
    Reimann, Keith A.
    Krauland, Eric
    Gerngross, Tillman U.
    Wittrup, Karl D.
    Saphire, Erica Ollmann
    Burton, Dennis R.
    Glass, Pamela J.
    Ward, Andrew B.
    Walker, Laura M.
    [J]. SCIENCE, 2016, 351 (6277) : 1078 - 1083
  • [7] Briggs AW., 2017, TUMOR INFILTRATING I, DOI [10.1101/134841, DOI 10.1101/134841]
  • [8] Clonify: unseeded antibody lineage assignment from next-generation sequencing data
    Briney, Bryan
    Le, Khoa
    Zhu, Jiang
    Burton, Dennis R.
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [9] Broadly Neutralizing Antibodies to HIV and Their Role in Vaccine Design
    Burton, Dennis R.
    Hangartner, Lars
    [J]. ANNUAL REVIEW OF IMMUNOLOGY, VOL 34, 2016, 34 : 635 - 659
  • [10] Chalapathy R, 2019, ARXIV180206360V2CSCV