AutoPPI: An Ensemble of Deep Autoencoders for Protein-Protein Interaction Prediction

被引:15
作者
Czibula, Gabriela [1 ]
Albu, Alexandra-Ioana [1 ]
Bocicor, Maria Iuliana [1 ]
Chira, Camelia [1 ]
机构
[1] Univ Babes Bolyai, Dept Comp Sci, Cluj Napoca 400084, Romania
关键词
deep learning; autoencoders; protein-protein interaction; FEATURE REPRESENTATION; SEQUENCE; COVARIANCE;
D O I
10.3390/e23060643
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Proteins are essential molecules, that must correctly perform their roles for the good health of living organisms. The majority of proteins operate in complexes and the way they interact has pivotal influence on the proper functioning of such organisms. In this study we address the problem of protein-protein interaction and we propose and investigate a method based on the use of an ensemble of autoencoders. Our approach, entitled AutoPPI, adopts a strategy based on two autoencoders, one for each type of interactions (positive and negative) and we advance three types of neural network architectures for the autoencoders. Experiments were performed on several data sets comprising proteins from four different species. The results indicate good performances of our proposed model, with accuracy and AUC values of over 0.97 in all cases. The best performing model relies on a Siamese architecture in both the encoder and the decoder, which advantageously captures common features in protein pairs. Comparisons with other machine learning techniques applied for the same problem prove that AutoPPI outperforms most of its contenders, for the considered data sets.
引用
收藏
页数:15
相关论文
共 47 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/3022670.2976746, 10.1145/2951913.2976746]
[2]  
Alain G, 2014, J MACH LEARN RES, V15, P3563
[3]  
[Anonymous], 2018, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2011, INSEQUENCE GENOME AN
[5]  
Bagheri H., 2020, RES SQ
[6]   Interval estimation for a binomial proportion - Comment - Rejoinder [J].
Brown, LD ;
Cai, TT ;
DasGupta, A ;
Agresti, A ;
Coull, BA ;
Casella, G ;
Corcoran, C ;
Mehta, C ;
Ghosh, M ;
Santner, TJ ;
Brown, LD ;
Cai, TT ;
DasGupta, A .
STATISTICAL SCIENCE, 2001, 16 (02) :101-133
[7]  
Browne F, 2007, PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, P1365
[8]   Protein-protein interaction prediction using a hybrid feature representation and a stacked generalization scheme [J].
Chen, Kuan-Hsi ;
Wang, Tsai-Feng ;
Hu, Yuh-Jyh .
BMC BIOINFORMATICS, 2019, 20 (1)
[9]   Multifaceted protein-protein interaction prediction based on Siamese residual RCNN [J].
Chen, Muhao ;
Ju, Chelsea J. -T. ;
Zhou, Guangyu ;
Chen, Xuelu ;
Zhang, Tianran ;
Chang, Kai-Wei ;
Zaniolo, Carlo ;
Wang, Wei .
BIOINFORMATICS, 2019, 35 (14) :I305-I314
[10]   Prediction of protein-protein interactions using random decision forest framework [J].
Chen, XW ;
Liu, M .
BIOINFORMATICS, 2005, 21 (24) :4394-4400