Prediction of protein-protein interactions based on deep neural networks

被引:0
|
作者
Liu G.-X. [1 ,2 ]
Wang M.-Y. [1 ,2 ]
Su L.-T. [1 ,2 ]
Wu C.-G. [1 ,2 ]
Sun L.-Y. [1 ,2 ]
Wang R.-Q. [1 ,2 ]
机构
[1] College of Computer Science and Technology, Jilin University, Changchun
[2] Symbol Computation and Knowledge Engineering of Ministry Education, Jilin University, Changchun
关键词
Artificial intelligence; Deep neural network; Protein features; Protein sequence; Protein-protein interaction;
D O I
10.13229/j.cnki.jdxbgxb20171101
中图分类号
学科分类号
摘要
In order to deal with the high false-positive to false-negative rate in experimental methods, a Deep Neural Network (DNN) is constructed based on several biology features. Protein features, including GO term semantic similarity, sequence similarity, essentiality and subcellular localization information, are integrated from diverse databases to form a fixed-length eigenvector. This vector contains a great deal of related information and can be used as the input of a classifier to predict protein interactions. Then the DNN which is data driven is constructed. It is used to automatically learn information from the input data and predict whether the unknown protein pairs interact or not. Dropout is used during the training phase to prevent co-adaption and improve its performance. The method achieves a prediction accuracy of 95.67% with 96.38% precision on the S. cerevisae dataset. Experimental results show that the extracted features are suitable for the prediction of PPIs, and many commonly used machine learning models can predict interaction effectively and efficiently based on this eigenvector. Moreover the DNN has good generalization capacity and shows high performance on various feature data. © 2019, Jilin University Press. All right reserved.
引用
收藏
页码:570 / 577
页数:7
相关论文
共 23 条
  • [11] Wang B., Datasets construction and accuracy analysis in protein-protein interaction prediction based on sequence and SVM, (2013)
  • [12] Consortium U.P., UniProt: the universal protein knowledge base, Nucleic Acids Research, 45, 1, pp. 158-169, (2017)
  • [13] Shen J., Zhang J., Luo X., Et al., Predicting protein-protein interactions based only on sequences information, Proceedings of the National Academy of Sciences of the United States of America, 104, 11, pp. 4337-4341, (2007)
  • [14] Chen W.H., Pablo M., Lercher M.J., Et al., OGEE: an online gene essentiality database, Nucleic Acids Research, 40, pp. 901-906, (2012)
  • [15] Consortium T.G.O., Geneontology consortium: going forward, Nucleic Acids Research, 43, D1, pp. 1049-1056, (2015)
  • [16] Hinton G.E., Srivastava N., Krizhevsky A., Et al., Improving neural networks by preventing co-adaptation of feature detectors, Computer Science, 3, 4, pp. 212-223, (2012)
  • [17] Yang L., Xia J.F., Gui J., Prediction of protein-protein interactions from protein sequence using local descriptors, Protein & Peptide Letters, 17, 9, pp. 1085-1090, (2010)
  • [18] Shi M.G., Xia J.F., Li X.L., Et al., Predicting protein-protein interactions from sequence using correlation coefficient and high-quality interaction dataset, Amino Acids, 38, 3, pp. 891-899, (2010)
  • [19] Martin S., Roe D., Faulon J.L., Predicting protein-protein interactions using signature products, Current Opinion in Structural Biology, 15, 4, pp. 441-446, (2005)
  • [20] Bock J.R., Gough D.A., Whole-proteome interaction mining, Bioinformatics, 19, 1, pp. 125-135, (2003)