Machine-learning techniques for the prediction of protein–protein interactions

被引:0
作者
Debasree Sarkar
Sudipto Saha
机构
[1] SUNY Upstate Medical University,Division of Bioinformatics
[2] Bose Institute,undefined
来源
Journal of Biosciences | 2019年 / 44卷
关键词
Clustering; deep learning; decision tree; machine-learning techniques; protein–protein interaction; support vector machine;
D O I
暂无
中图分类号
学科分类号
摘要
Protein–protein interactions (PPIs) are important for the study of protein functions and pathways involved in different biological processes, as well as for understanding the cause and progression of diseases. Several high-throughput experimental techniques have been employed for the identification of PPIs in a few model organisms, but still, there is a huge gap in identifying all possible binary PPIs in an organism. Therefore, PPI prediction using machine-learning algorithms has been used in conjunction with experimental methods for discovery of novel protein interactions. The two most popular supervised machine-learning techniques used in the prediction of PPIs are support vector machines and random forest classifiers. Bayesian-probabilistic inference has also been used but mainly for the scoring of high-throughput PPI dataset confidence measures. Recently, deep-learning algorithms have been used for sequence-based prediction of PPIs. Several clustering methods such as hierarchical and k-means are useful as unsupervised machine-learning algorithms for the prediction of interacting protein pairs without explicit data labelling. In summary, machine-learning techniques have been widely used for the prediction of PPIs thus allowing experimental researchers to study cellular PPI networks.
引用
收藏
相关论文
共 280 条
[1]  
Alonso-López D(2016)APID interactomes: Providing proteome-based interactomes with controlled quality for multiple species and derived networks Nucleic Acids Res. 44 W529-W535
[2]  
An JY(2016)RVMAB: Using the relevance vector machine model combined with average blocks to predict the interactions of proteins from protein sequences Int. J. Mol. Sci. 17 757-770
[3]  
You ZH(2003)An automated method for finding molecular complexes in large protein interaction networks BMC Bioinf. 4 2-68
[4]  
Meng FR(2017)A new feature vector based on gene ontology terms for protein–protein interaction prediction. ACM 14 762-i46
[5]  
Xu SJ(2011)Network medicine: A network-based approach to human disease Nat. Rev. Genet. 12 56-460
[6]  
Wang Y(2014)Prediction of interactions between viral and host proteins using supervised machine learning methods PLoS One 9 e112034-1494
[7]  
Bader GD(2015)Prediction of intra-species protein–protein interactions in enteropathogens facilitating systems biology study PLoS One 10 e0145648-15
[8]  
Hogue CW(2005)Kernel methods for predicting protein–protein interactions Bioinformatics 21 i38-32007
[9]  
Bandyopadhyay S(2010)Class prediction for high-dimensional class-imbalanced data BMC Bioinf. 11 523-D379
[10]  
Mallick K(2001)Predicting protein–protein interactions from primary structure Bioinformatics 17 455-278