MHC2SKpan: a novel kernel based approach for pan-specific MHC class II peptide binding prediction

被引:12
作者
Guo, Linyuan
Luo, Cheng
Zhu, Shanfeng [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
来源
BMC GENOMICS | 2013年 / 14卷
基金
中国国家自然科学基金;
关键词
AFFINITIES;
D O I
10.1186/1471-2164-14-S5-S11
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Computational methods for the prediction of Major Histocompatibility Complex (MHC) class II binding peptides play an important role in facilitating the understanding of immune recognition and the process of epitope discovery. To develop an effective computational method, we need to consider two important characteristics of the problem: (1) the length of binding peptides is highly flexible; and (2) MHC molecules are extremely polymorphic and for the vast majority of them there are no sufficient training data. Methods: We develop a novel string kernel MHC2SK (MHC-II String Kernel) method to measure the similarities among peptides with variable lengths. By considering the distinct features of MHC-II peptide binding prediction problem, MHC2SK differs significantly from the recently developed kernel based method, GS (Generic String) kernel, in the way of computing similarities. Furthermore, we extend MHC2SK to MHC2SKpan for pan-specific MHC-II peptide binding prediction by leveraging the binding data of various MHC molecules. Results: MHC2SK outperformed GS in allele specific prediction using a benchmark dataset, which demonstrates the effectiveness of MHC2SK. Furthermore, we evaluated the performance of MHC2SKpan using various benckmark data sets from several different perspectives: Leave-one-allele-out (LOO), 5-fold cross validation as well as independent data testing. MHC2SKpan has achieved comparable performance with NetMHCIIpan-2.0 and outperformed NetMHCIIpan-1.0, TEPITOPEpan and MultiRTA, being statistically significant. MHC2SKpan can be freely accessed at http://datamining-iip.fudan.edu.cn/service/MHC2SKpan/index.html.
引用
收藏
页数:9
相关论文
共 34 条
  • [1] [Anonymous], 2011, ACM T INTEL SYST TEC, DOI DOI 10.1145/1961189.1961199
  • [2] Baldi P., 2001, Bioinformatics: The Machine Learning Approach
  • [3] MultiRTA: A simple yet reliable method for predicting peptide binding affinities for multiple class II MHC allotypes
    Bordner, Andrew J.
    Mittelmann, Hans D.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [4] Prediction of the binding affinities of peptides to class II MHC using a regularized thermodynamic model
    Bordner, Andrew J.
    Mittelmann, Hans D.
    [J]. BMC BIOINFORMATICS, 2010, 11
  • [5] Prediction of promiscuous peptides that bind HLA class I molecules
    Brusic, V
    Petrovsky, N
    Zhang, GL
    Bajic, VB
    [J]. IMMUNOLOGY AND CELL BIOLOGY, 2002, 80 (03) : 280 - 285
  • [6] Learning a peptide-protein binding affinity predictor with kernel ridge regression
    Giguere, Sebastien
    Marchand, Mario
    Laviolette, Francois
    Drouin, Alexandre
    Corbeil, Jacques
    [J]. BMC BIOINFORMATICS, 2013, 14
  • [7] AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS
    HENIKOFF, S
    HENIKOFF, JG
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) : 10915 - 10919
  • [8] Ensemble approaches for improving HLA Class I-peptide binding prediction
    Hu, Xihao
    Mamitsuka, Hiroshi
    Zhu, Shanfeng
    [J]. JOURNAL OF IMMUNOLOGICAL METHODS, 2011, 374 (1-2) : 47 - 52
  • [9] MetaMHC: a meta approach to predict peptides binding to MHC molecules
    Hu, Xihao
    Zhou, Wenjian
    Udaka, Keiko
    Mamitsuka, Hiroshi
    Zhu, Shanfeng
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : W474 - W479
  • [10] Efficient peptideMHC-I binding prediction for alleles with few known binders
    Jacob, Laurent
    Vert, Jean-Philippe
    [J]. BIOINFORMATICS, 2008, 24 (03) : 358 - 366