AlgPred: prediction of allergenic proteins and mapping of IgE epitopes

被引:553
作者
Saha, Sudipto [1 ]
Raghava, G. P. S. [1 ]
机构
[1] Inst Microbial Technol, Bioinformat Ctr, Sector 39A, Chandigarh, India
关键词
D O I
10.1093/nar/gkl343
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this study a systematic attempt has been made to integrate various approaches in order to predict allergenic proteins with high accuracy. The dataset used for testing and training consists of 578 allergens and 700 non-allergens obtained from A. K. Bjorklund, D. Soeria-Atmadja, A. Zorzet, U. Hammerling and M. G. Gustafsson ( 2005) Bioinformatics, 21, 39-50. First, we developed methods based on support vector machine using amino acid and dipeptide composition and achieved an accuracy of 85.02 and 84.00%, respectively. Second, a motif-based method has been developed using MEME/MAST software that achieved sensitivity of 93.94 with 33.34% specificity. Third, a database of known IgE epitopes was searched and this predicted allergenic proteins with 17.47% sensitivity at specificity of 98.14%. Fourth, we predicted allergenic proteins by performing BLAST search against allergen representative peptides. Finally hybrid approaches have been developed, which combine two or more than two approaches. The performance of all these algorithms has been evaluated on an independent dataset of 323 allergens and on 101 725 non-allergens obtained from Swiss-Prot. A web server AlgPred has been developed for the predicting allergenic proteins and for mapping IgE epitopes on allergenic proteins (http://www.imtech.res.in/raghava/algpred/). AlgPred is available at www.imtech.res.in/raghava/algpred/.
引用
收藏
页码:W202 / W209
页数:8
相关论文
共 44 条
[1]  
Aalberse RC, 2000, J ALLERGY CLIN IMMUN, V106, P228, DOI 10.1067/mai.2000.108434
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[4]   Combining evidence using p-values: application to sequence homology searches [J].
Bailey, TL ;
Gribskov, M .
BIOINFORMATICS, 1998, 14 (01) :48-54
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   Feature-based prediction of non-classical and leaderless protein secretion [J].
Bendtsen, JD ;
Jensen, LJ ;
Blom, N ;
von Heijne, G ;
Brunak, S .
PROTEIN ENGINEERING DESIGN & SELECTION, 2004, 17 (04) :349-356
[7]   MHCBN: a comprehensive database of MHC binding and non-binding peptides [J].
Bhasin, M ;
Singh, H ;
Raghava, GPS .
BIOINFORMATICS, 2003, 19 (05) :665-666
[8]   Classification of nuclear receptors based on amino acid composition and dipeptide composition [J].
Bhasin, M ;
Raghava, GPS .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2004, 279 (22) :23262-23266
[9]   Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins [J].
Björklund, ÅK ;
Soeria-Atmadja, D ;
Zorzet, A ;
Hammerling, U ;
Gustafsson, MG .
BIOINFORMATICS, 2005, 21 (01) :39-50
[10]   Benchmarking B cell epitope prediction: Underperformance of existing methods [J].
Blythe, MJ ;
Flower, DR .
PROTEIN SCIENCE, 2005, 14 (01) :246-248