Development of a novel clustering tool for linear peptide sequences

被引:53
作者
Dhanda, Sandeep K. [1 ]
Vaughan, Kerrie [1 ]
Schulten, Veronique [1 ]
Grifoni, Alba [1 ]
Weiskopf, Daniela [1 ]
Sidney, John [1 ]
Peters, Bjoern [1 ,2 ]
Sette, Alessandro [1 ,2 ]
机构
[1] La Jolla Inst Allergy & Immunol, Div Vaccine Discovery, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Dept Med, San Diego, CA 92103 USA
基金
美国国家卫生研究院;
关键词
Allergy; Antigens/Peptides/Epitopes; Bioinformatics >; MHC/HLA; Viral; EPITOPE; REACTIVITY; MOLECULES; ALIGNMENT; ALLERGEN; DATABASE;
D O I
10.1111/imm.12984
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
Epitopes identified in large-scale screens of overlapping peptides often share significant levels of sequence identity, complicating the analysis of epitope-related data. Clustering algorithms are often used to facilitate these analyses, but available methods are generally insufficient in their capacity to define biologically meaningful epitope clusters in the context of the immune response. To fulfil this need we developed an algorithm that generates epitope clusters based on representative or consensus sequences. This tool allows the user to cluster peptide sequences on the basis of a specified level of identity by selecting among three different method options. These include the 'clique method', in which all members of the cluster must share the same minimal level of identity with each other, and the 'connected graph method', in which all members of a cluster must share a defined level of identity with at least one other member of the cluster. In cases where it is not possible to define a clear consensus sequence with the connected graph method, a third option provides a novel 'cluster-breaking algorithm' for consensus sequence driven sub-clustering. Herein we demonstrate the tool's clustering performance and applicability using (i) a selection of dengue virus epitopes for the 'clique method', (ii) sets of allergen-derived peptides from related species for the 'connected graph method' and (iii) large data sets of eluted ligand, major histocompatibility complex binding and T-cell recognition data captured within the Immune Epitope Database (IEDB) with the newly developed 'cluster-breaking algorithm'. This novel clustering tool is accessible at http://tools.iedb.org/cluster2/.
引用
收藏
页码:331 / 345
页数:15
相关论文
共 55 条
  • [31] TumorHoPe: A Database of Tumor Homing Peptides
    Kapoor, Pallavi
    Singh, Harinder
    Gautam, Ankur
    Chaudhary, Kumardeep
    Kumar, Rahul
    Raghava, Gajendra P. S.
    [J]. PLOS ONE, 2012, 7 (04):
  • [32] Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets
    Krejci, Adam
    Hupp, Ted R.
    Lexa, Matej
    Vojtesek, Borivoj
    Muller, Petr
    [J]. BIOINFORMATICS, 2016, 32 (01) : 9 - 16
  • [33] Peptide Sharing Between Influenza A H1N1 Hemagglutinin and Human Axon Guidance Proteins
    Lucchese, Guglielmo
    Capone, Giovanni
    Kanduc, Darja
    [J]. SCHIZOPHRENIA BULLETIN, 2014, 40 (02) : 362 - 375
  • [34] Clique-based data mining for related genes in a biomedical database
    Matsunaga, Tsutomu
    Yonemori, Chikara
    Tomita, Etsuji
    Muramatsu, Masaaki
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [35] Deciphering complex patterns of class-I HLA-peptide cross-reactivity via hierarchical grouping
    Mukherjee, Sumanta
    Warwicker, Jim
    Chandra, Nagasuma
    [J]. IMMUNOLOGY AND CELL BIOLOGY, 2015, 93 (06) : 522 - 532
  • [36] Ou DW, 1997, J RHEUMATOL, V24, P253
  • [37] Limited diversity of peptides related to an alloreactive T cell epitope in the HLA-B27-bound peptide repertoire results from restrictions at multiple steps along the processing-loading pathway
    Paradela, A
    Alvarez, I
    García-Peydró, M
    Sesma, L
    Ramos, M
    Vázquez, J
    de Castro, JAL
    [J]. JOURNAL OF IMMUNOLOGY, 2000, 164 (01) : 329 - 337
  • [38] HIERARCHICAL CLIQUE STRUCTURES
    PEAY, ER
    [J]. SOCIOMETRY, 1974, 37 (01): : 54 - 65
  • [39] HLA Peptide Length Preferences Control CD8+ T Cell Responses
    Rist, Melissa J.
    Theodossis, Alex
    Croft, Nathan P.
    Neller, Michelle A.
    Welland, Andrew
    Chen, Zhenjun
    Sullivan, Lucy C.
    Burrows, Jacqueline M.
    Miles, John J.
    Brennan, Rebekah M.
    Gras, Stephanie
    Khanna, Rajiv
    Brooks, Andrew G.
    McCluskey, James
    Purcell, Anthony W.
    Rossjohn, Jamie
    Burrows, Scott R.
    [J]. JOURNAL OF IMMUNOLOGY, 2013, 191 (02) : 561 - 571
  • [40] Clustering of Biological Datasets in the Era of Big Data
    Roettger, Richard
    [J]. JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (01) : 300