Method for Essential Protein Prediction Based on a Novel Weighted Protein-Domain Interaction Network

被引:11
作者
Meng, Zixuan [1 ]
Kuang, Linai [1 ]
Chen, Zhiping [2 ]
Zhang, Zhen [2 ]
Tan, Yihong [2 ]
Li, Xueyong [2 ]
Wang, Lei [1 ,2 ]
机构
[1] Xiangtan Univ, Coll Comp, Xiangtan, Peoples R China
[2] Changsha Univ, Coll Comp Engn & Appl Math, Changsha, Peoples R China
基金
中国国家自然科学基金;
关键词
essential proteins; protein-protein interaction network; computational model; domain-domain interaction network; protein-domain interaction network; DATABASE;
D O I
10.3389/fgene.2021.645932
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
In recent years a number of calculative models based on protein-protein interaction (PPI) networks have been proposed successively. However, due to false positives, false negatives, and the incompleteness of PPI networks, there are still many challenges affecting the design of computational models with satisfactory predictive accuracy when inferring key proteins. This study proposes a prediction model called WPDINM for detecting key proteins based on a novel weighted protein-domain interaction (PDI) network. In WPDINM, a weighted PPI network is constructed first by combining the gene expression data of proteins with topological information extracted from the original PPI network. Simultaneously, a weighted domain-domain interaction (DDI) network is constructed based on the original PDI network. Next, through integrating the newly obtained weighted PPI network and weighted DDI network with the original PDI network, a weighted PDI network is further constructed. Then, based on topological features and biological information, including the subcellular localization and orthologous information of proteins, a novel PageRank-based iterative algorithm is designed and implemented on the newly constructed weighted PDI network to estimate the criticality of proteins. Finally, to assess the prediction performance of WPDINM, we compared it with 12 kinds of competitive measures. Experimental results show that WPDINM can achieve a predictive accuracy rate of 90.19, 81.96, 70.72, 62.04, 55.83, and 51.13% in the top 1%, top 5%, top 10%, top 15%, top 20%, and top 25% separately, which exceeds the prediction accuracy achieved by traditional state-of-the-art competing measures. Owing to the satisfactory identification effect, the WPDINM measure may contribute to the further development of key protein identification.
引用
收藏
页数:15
相关论文
共 38 条
[31]   DIP, the Database of Interacting Proteins:: a research tool for studying cellular networks of protein interactions [J].
Xenarios, I ;
Salwínski, L ;
Duan, XQJ ;
Higney, P ;
Kim, SM ;
Eisenberg, D .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :303-305
[32]   DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes [J].
Zhang, Ren ;
Lin, Yan .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D455-D458
[33]   Predicting Essential Proteins by Integrating Network Topology, Subcellular Localization Information, Gene Expression Profile and GO Annotation Data [J].
Zhang, Wei ;
Xu, Jia ;
Zou, Xiufen .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (06) :2053-2061
[34]   Detecting Essential Proteins Based on Network Topology, Gene Expression Data, and Gene Ontology Information [J].
Zhang, Wei ;
Xu, Jia ;
Li, Yuanyuan ;
Zou, Xiufen .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (01) :109-116
[35]   An ensemble framework for identifying essential proteins [J].
Zhang, Xue ;
Xiao, Wangxin ;
Acencio, Marcio Luis ;
Lemke, Ney ;
Wang, Xujing .
BMC BIOINFORMATICS, 2016, 17
[36]   A New Method for the Discovery of Essential Proteins [J].
Zhang, Xue ;
Xu, Jin ;
Xiao, Wang-xin .
PLOS ONE, 2013, 8 (03)
[37]   Prediction of Essential Proteins Based on Overlapping Essential Modules [J].
Zhao, Bihai ;
Wang, Jianxin ;
Li, Min ;
Wu, Fang-xiang ;
Pan, Yi .
IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (04) :415-424
[38]   Prediction of essential proteins based on gene expression programming [J].
Zhong, Jiancheng ;
Wang, Jianxin ;
Peng, Wei ;
Zhang, Zhen ;
Pan, Yi .
BMC GENOMICS, 2013, 14