Deep Neural Network and Extreme Gradient Boosting Based Hybrid Classifier for Improved Prediction of Protein-Protein Interaction

被引:23
作者
Mahapatra, Satyajit [1 ]
Gupta, Vivek Raj [1 ]
Sahu, Sitanshu Sekhar [1 ]
Panda, Ganapati [2 ]
机构
[1] Birla Inst Technol, Dept ECE, Ranchi 835215, Jharkhand, India
[2] CV Raman Coll Engn, Bhubaneswar 752054, Odisha, India
关键词
Protein-protein interaction; information fusion; hybrid classifier; deep neural network; extreme gradient boosting; INFORMATION; ALGORITHM;
D O I
10.1109/TCBB.2021.3061300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the behavioral process of life and disease-causing mechanism, knowledge regarding protein-protein interactions (PPI) is essential. In this paper, a novel hybrid approach combining deep neural network (DNN) and extreme gradient boosting classifier (XGB) is employed for predicting PPI. The hybrid classifier (DNN-XGB) uses a fusion of three sequence-based features, amino acid composition (AAC), conjoint triad composition (CT), and local descriptor (LD) as inputs. The DNN extracts the hidden information through a layer-wise abstraction from the raw features that are passed through the XGB classifier. The 5-fold cross-validation accuracy for intraspecies interactions dataset of Saccharomyces cerevisiae (core subset), Helicobacter pylori, Saccharomyces cerevisiae, and Human are 98.35, 96.19, 97.37, and 99.74 percent respectively. Similarly, accuracies of 98.50 and 97.25 percent are achieved for interspecies interaction dataset of Human- Bacillus Anthracis and Human- Yersinia pestis datasets, respectively. The improved prediction accuracies obtained on the independent test sets and network datasets indicate that the DNN-XGB can be used to predict cross-species interactions. It can also provide new insights into signaling pathway analysis, predicting drug targets, and understanding disease pathogenesis. Improved performance of the proposed method suggests that the hybrid classifier can be used as a useful tool for PPI prediction.
引用
收藏
页码:155 / 165
页数:11
相关论文
共 50 条
[1]  
An Y., 2019, IEEE ACM T COMPUT BI, DOI [10.1109/TCBB.2019.29350599, DOI 10.1109/TCBB.2019.29350599]
[2]   Prediction of Interactions between Viral and Host Proteins Using Supervised Machine Learning Methods [J].
Barman, Ranjan Kumar ;
Saha, Sudipto ;
Das, Santasabuj .
PLOS ONE, 2014, 9 (11)
[3]   Application of eXtreme gradient boosting trees in the construction of credit risk assessment models for financial institutions [J].
Chang, Yung-Chia ;
Chang, Kuei-Hu ;
Wu, Guan-Jhih .
APPLIED SOFT COMPUTING, 2018, 73 :914-920
[4]   LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion [J].
Chen, Cheng ;
Zhang, Qingmei ;
Ma, Qin ;
Yu, Bin .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2019, 191 :54-64
[5]   Multifaceted protein-protein interaction prediction based on Siamese residual RCNN [J].
Chen, Muhao ;
Ju, Chelsea J. -T. ;
Zhou, Guangyu ;
Chen, Xuelu ;
Zhang, Tianran ;
Chang, Kai-Wei ;
Zaniolo, Carlo ;
Wang, Wei .
BIOINFORMATICS, 2019, 35 (14) :I305-I314
[6]   Predicting protein-protein interactions from sequences in a hybridization space [J].
Chou, KC ;
Cai, YD .
JOURNAL OF PROTEOME RESEARCH, 2006, 5 (02) :316-322
[7]   Predicting protein-protein interactions via multivariate mutual information of protein sequences [J].
Ding, Yijie ;
Tang, Jijun ;
Guo, Fei .
BMC BIOINFORMATICS, 2016, 17
[8]   MsDBP: Exploring DNA-Binding Proteins by Integrating Multiscale Sequence Information via Chou's Five-Step Rule [J].
Du, Xiuquan ;
Diao, Yanyu ;
Liu, Heng ;
Li, Shuo .
JOURNAL OF PROTEOME RESEARCH, 2019, 18 (08) :3119-3132
[9]   DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks [J].
Du, Xiuquan ;
Sun, Shiwei ;
Hu, Changlin ;
Yao, Yu ;
Yan, Yuanting ;
Zhang, Yanping .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2017, 57 (06) :1499-1510
[10]   A Novel Feature Extraction Scheme with Ensemble Coding for Protein-Protein Interaction Prediction [J].
Du, Xiuquan ;
Cheng, Jiaxing ;
Zheng, Tingting ;
Duan, Zheng ;
Qian, Fulan .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2014, 15 (07) :12731-12749