Protein-protein interactions prediction based on ensemble deep neural networks

被引:106
作者
Zhang, Long [1 ]
Yu, Guoxian [1 ]
Xia, Dawen [2 ,3 ]
Wang, Jun [1 ]
机构
[1] Southwest Univ, Coll Comp & Informat Sci, Chongqing 400715, Peoples R China
[2] Guizhou Minzu Univ, Coll Data Sci & Informat Engn, Guiyang 550025, Guizhou, Peoples R China
[3] Guizhou Minzu Univ, Coll Natl Culture & Cognit Sci, Guiyang 550025, Guizhou, Peoples R China
关键词
Protein-protein interactions; Sequences of amino acids; Deep neural networks; Sequence descriptors; Ensemble DNNs; COMPONENT ANALYSIS; LOCAL DESCRIPTION; HYDROPHOBICITY; SEQUENCES;
D O I
10.1016/j.neucom.2018.02.097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein interactions (PPIs) are of vital importance to most biological processes. Plenty of PPIs have been identified by wet-lab experiments in the past decades, but there are still abundant uncovered PPIs. Furthermore, wet-lab experiments are expensive and limited by the adopted experimental protocols. Although various computational models have been proposed to automatically predict PPIs and provided reliable interactions for experimental verification, the problem is still far from being solved. Novel and competent models are still anticipated. In this study, a neural network based approach called EnsDNN (Ensemble Deep Neural Networks) is proposed to predict PPIs based on different representations of amino acid sequences. Particularly, EnsDNN separately uses auto covariance descriptor, local descriptor, and multi-scale continuous and discontinuous local descriptor, to represent and explore the pattern of interactions between sequentially distant and spatially close amino acid residues. It then trains deep neural networks (DNNs) with different configurations based on each descriptor. Next, EnsDNN integrates these DNNs into an ensemble predictor to leverage complimentary information of these descriptors and of DNNs, and to predict potential PPIs. EnsDNN achieves superior performance with accuracy of 95.29%, sensitivity of 95.12%, and precision of 95.45% on predicting PPIs of Saccharomyces cerevisiae. Results on other five independent PPI datasets also demonstrate that EnsDNN gets better prediction performance than other related comparing methods. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:10 / 19
页数:10
相关论文
共 55 条
[1]  
Al Shalabi L., 2006, Journal of Computer Sciences, V2, P735, DOI 10.3844/jcssp.2006.735.739
[2]   Deep learning for computational biology [J].
Angermueller, Christof ;
Parnamaa, Tanel ;
Parts, Leopold ;
Stegle, Oliver .
MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
[3]   Algorithmic approaches to protein-protein interaction site prediction [J].
Aumentado-Armstrong, Tristan T. ;
Istrate, Bogdan ;
Murgita, Robert A. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
[4]   THE STRUCTURAL DEPENDENCE OF AMINO-ACID HYDROPHOBICITY PARAMETERS [J].
CHARTON, M ;
CHARTON, BI .
JOURNAL OF THEORETICAL BIOLOGY, 1982, 99 (04) :629-644
[5]   Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes [J].
Chou, KC .
BIOINFORMATICS, 2005, 21 (01) :10-19
[6]   Computer prediction of allergen proteins from sequence-derived protein structural and physicochemical properties [J].
Cui, Juan ;
Han, Lian Yi ;
Li, Hu ;
Ung, Choong Yong ;
Tang, Zhi Qun ;
Zheng, Chan Juan ;
Cao, Zhi Wei ;
Chen, Yu Zong .
MOLECULAR IMMUNOLOGY, 2007, 44 (04) :514-520
[7]   Optimizing amino acid groupings for GPCR classification [J].
Davies, Matthew N. ;
Secker, Andrew ;
Freitas, Alex A. ;
Clark, Edward ;
Timmis, Jon ;
Flower, Darren R. .
BIOINFORMATICS, 2008, 24 (18) :1980-1986
[8]   Predicting Hub Genes Associated with Cervical Cancer through Gene Co-Expression Networks [J].
Deng, Su-Ping ;
Zhu, Lin ;
Huang, De-Shuang .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (01) :27-35
[9]   Mining the bladder cancer-associated genes by an integrated strategy for the construction and analysis of differential co-expression networks [J].
Deng, Su-Ping ;
Zhu, Lin ;
Huang, De-Shuang .
BMC GENOMICS, 2015, 16
[10]   SFAPS: An R package for structure/function analysis of protein sequences based on informational spectrum method [J].
Deng, Su-Ping ;
Huang, De-Shuang .
METHODS, 2014, 69 (03) :207-212