Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method

被引:413
作者
Nielsen, Morten [1 ]
Lundegaard, Claus [1 ]
Lund, Ole [1 ]
机构
[1] Tech Univ Denmark, Ctr Biol Sequence Anal, Bioctr, DK-2800 Lyngby, Denmark
关键词
D O I
10.1186/1471-2105-8-238
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Antigen presenting cells (APCs) sample the extra cellular space and present peptides from here to T helper cells, which can be activated if the peptides are of foreign origin. The peptides are presented on the surface of the cells in complex with major histocompatibility class II (MHC II) molecules. Identification of peptides that bind MHC II molecules is thus a key step in rational vaccine design and developing methods for accurate prediction of the peptide: MHC interactions play a central role in epitope discovery. The MHC class II binding groove is open at both ends making the correct alignment of a peptide in the binding groove a crucial part of identifying the core of an MHC class II binding motif. Here, we present a novel stabilization matrix alignment method, SMM-align, that allows for direct prediction of peptide: MHC binding affinities. The predictive performance of the method is validated on a large MHC class II benchmark data set covering 14 HLA-DR (human MHC) and three mouse H2-1A alleles. Results: The predictive performance of the SMM-align method was demonstrated to be superior to that of the Gibbs sampler, TEPITOPE, SVRMHC, and MHCpred methods. Cross validation between peptide data set obtained from different sources demonstrated that direct incorporation of peptide length potentially results in over-fitting of the binding prediction method. Focusing on amino terminal peptide flanking residues (PFR), we demonstrate a consistent gain in predictive performance by favoring binding registers with a minimum PFR length of two amino acids. Visualizing the binding motif as obtained by the SMM-align and TEPITOPE methods highlights a series of fundamental discrepancies between the two predicted motifs. For the DRB1*1302 allele for instance, the TEPITOPE method favors basic amino acids at most anchor positions, whereas the SMM-align method identifies a preference for hydrophobic or neutral amino acids at the anchors. Conclusion: The SMM-align method was shown to outperform other state of the art MHC class II prediction methods. The method predicts quantitative peptide: MHC binding affinity values, making it ideally suited for rational epitope discovery. The method has been trained and evaluated on the, to our knowledge, largest benchmark data set publicly available and covers the nine HLADR supertypes suggested as well as three mouse H2-IA allele. Both the peptide benchmark data set, and SMM-align prediction method (NetMHCII) are made publicly available.
引用
收藏
页数:12
相关论文
共 27 条
  • [21] Sette Alessandro, 2005, Immunity, V22, P155, DOI 10.1016/j.immuni.2005.01.009
  • [22] ProPred: prediction of HLA-DR binding sites
    Singh, H
    Raghava, GPS
    [J]. BIOINFORMATICS, 2001, 17 (12) : 1236 - 1237
  • [23] Generation of tissue-specific and promiscuous HLA ligand databases using DNA microarrays and virtual HLA class II matrices
    Sturniolo, T
    Bono, E
    Ding, JY
    Raddrizzani, L
    Tuereci, O
    Sahin, U
    Braxenthaler, M
    Gallazzi, F
    Protti, MP
    Sinigaglia, F
    Hammer, J
    [J]. NATURE BIOTECHNOLOGY, 1999, 17 (06) : 555 - 561
  • [24] MEASURING THE ACCURACY OF DIAGNOSTIC SYSTEMS
    SWETS, JA
    [J]. SCIENCE, 1988, 240 (4857) : 1285 - 1293
  • [25] Toseland Christopher P, 2005, Immunome Res, V1, P4, DOI 10.1186/1745-7580-1-4
  • [26] SVRMHC prediction server for MHC-binding peptides
    Wan, Ji
    Liu, Wen
    Xu, Qiqi
    Ren, Yongliang
    Flower, Darren R.
    Li, Tongbin
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [27] PREDBALB/c:: a system for the prediction of peptide binding to H2d molecules, a haplotype of the BALB/c mouse
    Zhang, GL
    Srinivasan, KN
    Veeramani, A
    August, JT
    Brusic, V
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 : W180 - W183