Machine learning approaches for prediction of linear B-cell epitopes on proteins

被引:71
作者
Söllner, Johannes
Mayer, Bernd
机构
[1] Emergentec Biodev GmbH, A-1010 Vienna, Austria
[2] Intercell AG, A-1030 Vienna, Austria
[3] Univ Vienna, Inst Theoret Chem, A-1090 Vienna, Austria
关键词
machine learning; epitope; classification algorithm; peptide descriptor;
D O I
10.1002/jmr.771
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Identification and characterization of antigenic determinants on proteins has received considerable attention utilizing both, experimental as well as computational methods. For computational routines mostly structural as well as physicochemical parameters have been utilized for predicting the antigenic propensity of protein sites. However, the performance of computational routines has been low when compared to experimental alternatives. Here we describe the construction of machine learning based classifiers to enhance the prediction quality for identifying linear B-cell epitopes on proteins. Our approach combines several parameters previously associated with antigenicity, and includes novel parameters based on frequencies of amino acids and amino acid neighborhood propensities. We utilized machine learning algorithms for deriving antigenicity classification functions assigning antigenic propensities to each amino acid of a given protein sequence. We compared the prediction quality of the novel classifiers with respect to established routines for epitope scoring, and tested prediction accuracy on experimental data available for HIV proteins. The major finding is that machine learning classifiers clearly outperform the reference classification systems on the HIV epitope validation set. Copyright (c) 2006 John Wiley & Sons, Ltd.
引用
收藏
页码:200 / 208
页数:9
相关论文
共 22 条
[1]   Predictive estimation of protein linear epitopes by using the program PEOPLE [J].
Alix, AJP .
VACCINE, 1999, 18 (3-4) :311-314
[2]   Prediction of CTL epitopes using QM, SVM and ANN techniques [J].
Bhasin, M ;
Raghava, GPS .
VACCINE, 2004, 22 (23-24) :3195-3204
[3]   Benchmarking B cell epitope prediction: Underperformance of existing methods [J].
Blythe, MJ ;
Flower, DR .
PROTEIN SCIENCE, 2005, 14 (01) :246-248
[4]   Identification of in vivo expressed vaccine candidate antigens from Staphylococcus aureus [J].
Etz, H ;
Minh, DB ;
Henics, T ;
Dryla, A ;
Winkler, B ;
Triska, C ;
Boyd, AP ;
Söllner, J ;
Schmidt, W ;
von Ahsen, U ;
Buschle, M ;
Gill, SR ;
Kolonay, J ;
Khalak, H ;
Fraser, CM ;
von Gabain, A ;
Nagy, E ;
Meinke, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) :6573-6578
[5]   Defining epitopes: It's not as easy as it seems [J].
Greenspan, NS ;
Di Cera, E .
NATURE BIOTECHNOLOGY, 1999, 17 (10) :936-937
[6]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[7]   PREDICTION OF PROTEIN ANTIGENIC DETERMINANTS FROM AMINO-ACID-SEQUENCES [J].
HOPP, TP ;
WOODS, KR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1981, 78 (06) :3824-3828
[8]   PREDICTION OF CHAIN FLEXIBILITY IN PROTEINS - A TOOL FOR THE SELECTION OF PEPTIDE ANTIGENS [J].
KARPLUS, PA ;
SCHULZ, GE .
NATURWISSENSCHAFTEN, 1985, 72 (04) :212-213
[9]   A SEMIEMPIRICAL METHOD FOR PREDICTION OF ANTIGENIC DETERMINANTS ON PROTEIN ANTIGENS [J].
KOLASKAR, AS ;
TONGAONKAR, PC .
FEBS LETTERS, 1990, 276 (1-2) :172-174
[10]  
Kuiken Carla, 2003, AIDS Reviews, V5, P52