AutoPPI: An Ensemble of Deep Autoencoders for Protein-Protein Interaction Prediction

被引:15
作者
Czibula, Gabriela [1 ]
Albu, Alexandra-Ioana [1 ]
Bocicor, Maria Iuliana [1 ]
Chira, Camelia [1 ]
机构
[1] Univ Babes Bolyai, Dept Comp Sci, Cluj Napoca 400084, Romania
关键词
deep learning; autoencoders; protein-protein interaction; FEATURE REPRESENTATION; SEQUENCE; COVARIANCE;
D O I
10.3390/e23060643
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Proteins are essential molecules, that must correctly perform their roles for the good health of living organisms. The majority of proteins operate in complexes and the way they interact has pivotal influence on the proper functioning of such organisms. In this study we address the problem of protein-protein interaction and we propose and investigate a method based on the use of an ensemble of autoencoders. Our approach, entitled AutoPPI, adopts a strategy based on two autoencoders, one for each type of interactions (positive and negative) and we advance three types of neural network architectures for the autoencoders. Experiments were performed on several data sets comprising proteins from four different species. The results indicate good performances of our proposed model, with accuracy and AUC values of over 0.97 in all cases. The best performing model relies on a Siamese architecture in both the encoder and the decoder, which advantageously captures common features in protein pairs. Comparisons with other machine learning techniques applied for the same problem prove that AutoPPI outperforms most of its contenders, for the considered data sets.
引用
收藏
页数:15
相关论文
共 47 条
[11]   iFeature: a Python']Python package and web server for features extraction and selection from protein and peptide sequences [J].
Chen, Zhen ;
Zhao, Pei ;
Li, Fuyi ;
Leier, Andre ;
Marquez-Lago, Tatiana T. ;
Wang, Yanan ;
Webb, Geoffrey I. ;
Smith, A. Ian ;
Daly, Roger J. ;
Chou, Kuo-Chen ;
Song, Jiangning .
BIOINFORMATICS, 2018, 34 (14) :2499-2502
[12]   Evaluation Measures of the Classification Performance of Imbalanced Data Sets [J].
Gu, Qiong ;
Zhu, Li ;
Cai, Zhihua .
COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 :461-+
[13]   Using Deep Neural Networks to Improve the Performance of Protein-Protein Interactions Prediction [J].
Gui, Yuan-Miao ;
Wang, Ru-Jing ;
Wang, Xue ;
Wei, Yuan-Yuan .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (13)
[14]   DNN-PPI: A LARGE-SCALE PREDICTION OF PROTEIN-PROTEIN INTERACTIONS BASED ON DEEP NEURAL NETWORKS [J].
Gui, Yuanmiao ;
Wang, Rujing ;
Wei, Yuanyuan ;
Wang, Xue .
JOURNAL OF BIOLOGICAL SYSTEMS, 2019, 27 (01) :1-18
[15]   Using support vector machine combined with auto covariance to predict proteinprotein interactions from protein sequences [J].
Guo, Yanzhi ;
Yu, Lezheng ;
Wen, Zhining ;
Li, Menglong .
NUCLEIC ACIDS RESEARCH, 2008, 36 (09) :3025-3030
[16]   PRED-PPI: A server for predicting protein-protein interactions based on sequence data with probability assignment [J].
Guo Y. ;
Li M. ;
Pu X. ;
Li G. ;
Guang X. ;
Xiong W. ;
Li J. .
BMC Research Notes, 3 (1)
[17]   Predicting protein-protein interactions through sequence-based deep learning [J].
Hashemifar, Somaye ;
Neyshabur, Behnam ;
Khan, Aly A. ;
Xu, Jinbo .
BIOINFORMATICS, 2018, 34 (17) :802-810
[18]   A Bayesian networks approach for predicting protein-protein interactions from genomic data [J].
Jansen, R ;
Yu, HY ;
Greenbaum, D ;
Kluger, Y ;
Krogan, NJ ;
Chung, SB ;
Emili, A ;
Snyder, M ;
Greenblatt, JF ;
Gerstein, M .
SCIENCE, 2003, 302 (5644) :449-453
[19]  
Klambauer G., ARXIV 2017 170602515
[20]  
Koch G., 2015, P DEEP LEARN WORKSH