Deep Neural Network and Extreme Gradient Boosting Based Hybrid Classifier for Improved Prediction of Protein-Protein Interaction

被引:23
作者
Mahapatra, Satyajit [1 ]
Gupta, Vivek Raj [1 ]
Sahu, Sitanshu Sekhar [1 ]
Panda, Ganapati [2 ]
机构
[1] Birla Inst Technol, Dept ECE, Ranchi 835215, Jharkhand, India
[2] CV Raman Coll Engn, Bhubaneswar 752054, Odisha, India
关键词
Protein-protein interaction; information fusion; hybrid classifier; deep neural network; extreme gradient boosting; INFORMATION; ALGORITHM;
D O I
10.1109/TCBB.2021.3061300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the behavioral process of life and disease-causing mechanism, knowledge regarding protein-protein interactions (PPI) is essential. In this paper, a novel hybrid approach combining deep neural network (DNN) and extreme gradient boosting classifier (XGB) is employed for predicting PPI. The hybrid classifier (DNN-XGB) uses a fusion of three sequence-based features, amino acid composition (AAC), conjoint triad composition (CT), and local descriptor (LD) as inputs. The DNN extracts the hidden information through a layer-wise abstraction from the raw features that are passed through the XGB classifier. The 5-fold cross-validation accuracy for intraspecies interactions dataset of Saccharomyces cerevisiae (core subset), Helicobacter pylori, Saccharomyces cerevisiae, and Human are 98.35, 96.19, 97.37, and 99.74 percent respectively. Similarly, accuracies of 98.50 and 97.25 percent are achieved for interspecies interaction dataset of Human- Bacillus Anthracis and Human- Yersinia pestis datasets, respectively. The improved prediction accuracies obtained on the independent test sets and network datasets indicate that the DNN-XGB can be used to predict cross-species interactions. It can also provide new insights into signaling pathway analysis, predicting drug targets, and understanding disease pathogenesis. Improved performance of the proposed method suggests that the hybrid classifier can be used as a useful tool for PPI prediction.
引用
收藏
页码:155 / 165
页数:11
相关论文
共 50 条
[11]   Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method [J].
Goktepe, Yunus Emre ;
Kodaz, Halife .
NEUROCOMPUTING, 2018, 303 :68-74
[12]  
Gonzalez-Lopez F, 2018, IEEE INT C BIOINFORM, P2344, DOI 10.1109/BIBM.2018.8621328
[13]   Using support vector machine combined with auto covariance to predict proteinprotein interactions from protein sequences [J].
Guo, Yanzhi ;
Yu, Lezheng ;
Wen, Zhining ;
Li, Menglong .
NUCLEIC ACIDS RESEARCH, 2008, 36 (09) :3025-3030
[14]   Predicting protein-protein interactions through sequence-based deep learning [J].
Hashemifar, Somaye ;
Neyshabur, Behnam ;
Khan, Aly A. ;
Xu, Jinbo .
BIOINFORMATICS, 2018, 34 (17) :802-810
[15]   Construction of reliable protein-protein interaction networks using weighted sparse representation based classifier with pseudo substitution matrix representation features [J].
Huang, Yu-An ;
You, Zhu-Hong ;
Li, Xiao ;
Chen, Xing ;
Hu, Pengwei ;
Li, Shuai ;
Luo, Xin .
NEUROCOMPUTING, 2016, 218 :131-138
[16]   Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding [J].
Huang, Yu-An ;
You, Zhu-Hong ;
Chen, Xing ;
Chan, Keith ;
Luo, Xin .
BMC BIOINFORMATICS, 2016, 17
[17]   Deep Convolutional and Recurrent Neural Networks for Cell Motility Discrimination and Prediction [J].
Kimmel, Jacob C. ;
Brack, Andrew S. ;
Marshall, Wallace F. .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (02) :562-574
[18]   A new sequence based encoding for prediction of host-pathogen protein interactions [J].
Kosesoy, Irfan ;
Gok, Murat ;
Oz, Cemil .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 78 :170-177
[19]   Using Weighted Extreme Learning Machine Combined With Scale-Invariant Feature Transform to Predict Protein-Protein Interactions From Protein Evolutionary Information [J].
Li, Jianqiang ;
Shi, Xiaofeng ;
You, Zhu-Hong ;
Yi, Hai-Cheng ;
Chen, Zhuangzhuang ;
Lin, Qiuzhen ;
Fang, Min .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (05) :1546-1554
[20]   Deep Neural Networks for In Situ Hybridization Grid Completion and Clustering [J].
Li, Yujie ;
Huang, Heng ;
Chen, Hanbo ;
Liu, Tianming .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) :536-546