ConvsPPIS: Identifying Protein-protein Interaction Sites by an Ensemble Convolutional Neural Network with Feature Graph

被引:36
作者
Zhu, Huaixu [1 ]
Du, Xiuquan [1 ]
Yao, Yu [1 ]
机构
[1] Anhui Univ, Sch Comp Sci & Technol, POB 230601, Hefei, Peoples R China
基金
美国国家科学基金会;
关键词
Feature graph; positional context; protein complex; interface prediction; convolution neural network; ensemble learning; SEQUENCE-BASED PREDICTION; WEB SERVER; CLASSIFIER; INTERFACES; RESIDUES; PROFILE;
D O I
10.2174/1574893614666191105155713
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background/ Objective: Protein-protein interactions are essentials for most cellular processes and thus, unveiling how proteins interact with is a crucial question that can be better understood by recognizing which residues participate in the interaction. Although many computational approaches have been proposed to predict interface residues, their feature perspective and model learning ability are not enough to achieve ideal results. So, our objective is to improve the predictive performance under considering feature perspective and new learning algorithm. Method: In this study, we proposed an ensemble deep convolutional neural network, which explores the context and positional context of consecutive residues within a protein sub-sequence. Specifically, unlike the feature view of previous methods, ConvsPPIS uses evolutionary, physicochemical, and structural protein characteristics to construct their own feature graph respectively. After that, three independent deep convolutional neural networks are trained on each type of feature graph for learning the underlying pattern in sub-sequence. Lastly, we integrated those three deep networks into an ensemble predictor with leveraging complementary information of those features to predict potential interface residues. Results: Some comparative experiments have conducted through 10-fold cross-validation. The results indicated that ConvsPPIS achieved superior performance on DBv5-Sel dataset with an accuracy of 88%. Additional experiments on CAPRI-Alone dataset demonstrated ConvsPPIS has also better prediction performance. Conclusion: The ConvsPPIS method provided a new perspective to capture protein feature expression for identifying protein-protein interaction sites. The results proved the superiority of this method.
引用
收藏
页码:368 / 378
页数:11
相关论文
共 43 条
[1]  
Alberts B., 2017, Molecular biology of the cell
[2]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]  
[Anonymous], 2015, GitHub repository
[5]   Algorithmic approaches to protein-protein interaction site prediction [J].
Aumentado-Armstrong, Tristan T. ;
Istrate, Bogdan ;
Murgita, Robert A. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
[6]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[7]   Improved prediction of protein-protein binding sites using a support vector machines approach [J].
Bradford, JR ;
Westhead, DR .
BIOINFORMATICS, 2005, 21 (08) :1487-1494
[8]   Prediction of interface residues in protein-protein complexes by a consensus neural network method: Test against NMR data [J].
Chen, HL ;
Zhou, HX .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 (01) :21-35
[9]   Sequence-based prediction of protein interaction sites with an integrative method [J].
Chen, Xue-Wen ;
Jeong, Jong Cheol .
BIOINFORMATICS, 2009, 25 (05) :585-591
[10]   How proteins get in touch: Interface prediction in the study of biomolecular complexes [J].
de Vries, Sjoerd J. ;
Bonvin, Alexandre M. J. J. .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2008, 9 (04) :394-406