Allergenicity prediction by artificial neural networks

被引:9
作者
Dimitrov, Ivan [1 ]
Naneva, Lyudmila [2 ]
Bangov, Ivan [2 ]
Doytchinova, Irini [1 ]
机构
[1] Med Univ Sofia, Fac Pharm, Sofia 1000, Bulgaria
[2] Konstantin Preslavski Shumen Univ, Fac Nat Sci, Shumen 9712, Bulgaria
关键词
amino acid descriptors; ACC transformation; descriptor fingerprint; artificial neural networks; PROTEINS;
D O I
10.1002/cem.2597
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Two artificial neural network (ANN)-based algorithms for allergenicity prediction were developed and tested. The first algorithm consists of three steps. Initially, the protein sequences are described by amino acid principal properties as hydrophobicity, size, relative abundance, helix and -strand forming propensities. Second, the generated strings of different length are converted into vectors with equal length by auto-covariance and cross-covariance (ACC). At the third step, ANN is applied to discriminate between allergens and non-allergens. The second algorithm consists of four steps. It has one additional step before the final ANN modeling. At this step, the ACC vectors are transformed into binary fingerprints. The algorithms were applied to a set of 2427 known allergens and 2427 non-allergens and compared in terms of predictive ability. The three-step algorithm performed better than the four-step one identifying 82% versus 76% of the allergens and non-allergens. The ANN algorithms presented here are universal. They could be applied for any classification problem in computational biology. The amino acid descriptors are able to capture the main structural and physicochemical properties of amino acids building the proteins. The ACC transformation overcomes the main problem in the alignment-based comparative studies arising from the different length of the aligned protein sequences. The uniform-length vectors allow similarity search and classification by different computational methods. Optionally, the ACC vectors could be converted into binary descriptor fingerprints. The comparative study on several Web tools for allergenicity prediction showed that the usage of more than one predictor is reasonable and recommendable because some of the tools recognize better the allergens, some of themthe non-allergens, but none of themboth. Copyright (c) 2014 John Wiley & Sons, Ltd. Two artificial neural network (ANN)-based algorithms for allergenicity prediction were developed and tested. The first algorithm consists of three steps, and the second four steps. The algorithms were applied to a set of 2427 known allergens and 2427 non-allergens and compared in terms of predictive ability. The three-step algorithm identified 82% of the allergens and non-allergens, and the four-step algorithm 76% of them. The ANN algorithms presented here are universal. They could be applied for any classification problem in computational biology.
引用
收藏
页码:282 / 286
页数:5
相关论文
共 20 条
[1]  
[Anonymous], 1958, Elementary mathematical theory of classification and prediction
[2]   Supervised identification of allergen-representative peptides for in silico detection of potentially allergenic proteins [J].
Björklund, ÅK ;
Soeria-Atmadja, D ;
Zorzet, A ;
Hammerling, U ;
Gustafsson, MG .
BIOINFORMATICS, 2005, 21 (01) :39-50
[3]   Intestinal worms and human allergy [J].
Cooper, PJ .
PARASITE IMMUNOLOGY, 2004, 26 (11-12) :455-467
[4]   AllergenFP: allergenicity prediction by descriptor fingerprints [J].
Dimitrov, Ivan ;
Naneva, Lyudmila ;
Doytchinova, Irini ;
Bangov, Ivan .
BIOINFORMATICS, 2014, 30 (06) :846-851
[5]   AllerTOP - a server for in silico prediction of allergens [J].
Dimitrov, Ivan ;
Flower, Darren R. ;
Doytchinova, Irini .
BMC BIOINFORMATICS, 2013, 14
[6]  
FAO/WHO Codex Alimentarius Commission, 2003, COD PRINC GUID FOODS
[7]   Allermatch™, a webtool for the prediction of potential allergenicity according to current FAO/WHO Codex alimentarius guidelines -: art. no. 133 [J].
Fiers, MWEJ ;
Kleter, GA ;
Nijland, H ;
Peijnenburg, AACM ;
Nap, JP ;
van Ham, RCHJ .
BMC BIOINFORMATICS, 2004, 5 (1)
[8]   An attempt to define allergen-specific molecular surface features: a bioinformatic approach [J].
Furmonaviciene, R ;
Sutton, BJ ;
Glaser, F ;
Laughton, CA ;
Jones, N ;
Sewell, HF ;
Shakib, F .
BIOINFORMATICS, 2005, 21 (23) :4201-4204
[9]  
Hall M., 2009, SIGKDD Explorations, V11, P10, DOI DOI 10.1145/1656274.1656278
[10]   PEPTIDE QUANTITATIVE STRUCTURE-ACTIVITY-RELATIONSHIPS, A MULTIVARIATE APPROACH [J].
HELLBERG, S ;
SJOSTROM, M ;
SKAGERBERG, B ;
WOLD, S .
JOURNAL OF MEDICINAL CHEMISTRY, 1987, 30 (07) :1126-1135