A community resource benchmarking predictions of peptide binding to MHC-I molecules

被引:228
作者
Peters, Bjoern [1 ]
Bui, Huynh-Hoa
Frankild, Sune
Nielsen, Morten
Lundegaard, Claus
Kostem, Emrah
Basch, Derek
Lamberth, Kasper
Harndahl, Mikkel
Fleri, Ward
Wilson, Stephen S.
Sidney, John
Lund, Ole
Buus, Soren
Sette, Alessandro
机构
[1] La Jolla Inst Allergy & Immunol, San Diego, CA USA
[2] Tech Univ Denmark, Bioctr, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[3] Univ Copenhagen, Inst Med Microbiol & Immunol, Dept Expt Immunol, Copenhagen, Denmark
关键词
D O I
10.1371/journal.pcbi.0020065
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recognition of peptides bound to major histocompatibility complex (MHC) class I molecules by T lymphocytes is an essential part of immune surveillance. Each MHC allele has a characteristic peptide binding preference, which can be captured in prediction algorithms, allowing for the rapid scan of entire pathogen proteomes for peptide likely to bind MHC. Here we make public a large set of 48,828 quantitative peptide-binding affinity measurements relating to 48 different mouse, human, macaque, and chimpanzee MHC class I alleles. We use this data to establish a set of benchmark predictions with one neural network method and two matrix-based prediction methods extensively utilized in our groups. In general, the neural network outperforms the matrix-based predictions mainly due to its ability to generalize even on a small amount of data. We also retrieved predictions from tools publicly available on the internet. While differences in the data used to generate these predictions hamper direct comparisons, we do conclude that tools based on combinatorial peptide libraries perform remarkably well. The transparent prediction evaluation on this dataset provides tool developers with a benchmark for comparison of newly developed prediction methods. In addition, to generate and evaluate our own prediction methods, we have established an easily extensible web-based prediction framework that allows automated side-by-side comparisons of prediction methods implemented by experts. This is an advance over the current practice of tool developers having to generate reference predictions themselves, which can lead to underestimating the performance of prediction methods they are not as familiar with as their own. The overall goal of this effort is to provide a transparent prediction evaluation allowing bioinformaticians to identify promising features of prediction methods and providing guidance to immunologists regarding the reliability of prediction tools.
引用
收藏
页码:574 / 584
页数:11
相关论文
共 50 条
  • [1] Analysis and prediction of affinity of TAP binding peptides using cascade SVM
    Bhasin, M
    Raghava, GPS
    [J]. PROTEIN SCIENCE, 2004, 13 (03) : 596 - 607
  • [2] MHCBN: a comprehensive database of MHC binding and non-binding peptides
    Bhasin, M
    Singh, H
    Raghava, GPS
    [J]. BIOINFORMATICS, 2003, 19 (05) : 665 - 666
  • [3] SVM based method for predicting HLA-DRB1*0401 binding peptides in an antigen sequence
    Bhasin, M
    Raghava, GPS
    [J]. BIOINFORMATICS, 2004, 20 (03) : 421 - 423
  • [4] Brusic V, 1999, In Silico Biol, V1, P109
  • [5] Automated generation and evaluation of specific MHC binding predictive tools:: ARB matrix applications
    Bui, HH
    Sidney, J
    Peters, B
    Sathiamurthy, M
    Sinichi, A
    Purton, KA
    Mothé, BR
    Chisari, FV
    Watkins, DI
    Sette, A
    [J]. IMMUNOGENETICS, 2005, 57 (05) : 304 - 314
  • [6] Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach
    Buus, S
    Lauemoller, SL
    Worning, P
    Kesmir, C
    Frimurer, T
    Corbet, S
    Fomsgaard, A
    Hilden, J
    Holm, A
    Brunak, S
    [J]. TISSUE ANTIGENS, 2003, 62 (05): : 378 - 384
  • [7] Descamps FJ, 2003, ADV EXP MED BIOL, V535, P69
  • [8] Prediction of MHC class I binding peptides, using SVMHC -: art. no. 25
    Dönnes, P
    Elofsson, A
    [J]. BMC BIOINFORMATICS, 2002, 3 (1)
  • [9] GOLDSTEIN A, 1987, MOL PHARMACOL, V31, P603
  • [10] Guan Pingping, 2003, Appl Bioinformatics, V2, P63