Essential Protein Detection by Random Walk on Weighted Protein-Protein Interaction Networks

被引:19
作者
Xu, Bin [1 ]
Guan, Jihong [1 ]
Wang, Yang [2 ]
Wang, Zewei [3 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 200092, Peoples R China
[2] Nanchang Univ, Dept Comp Sci & Technol, Nanchang 330031, Jiangxi, Peoples R China
[3] Shanghai Southwest Model Middle Sch, 67 Huicheng Vallige 1,Baise Rd, Shanghai 200237, Peoples R China
基金
中国国家自然科学基金;
关键词
Essential protein; weighted protein-protein interaction network; random walk; ESSENTIAL GENES; IDENTIFICATION; DATABASE; CENTRALITY; CONNECTIVITY; PREDICTION; TOOL;
D O I
10.1109/TCBB.2017.2701824
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Essential proteins are critical to the development and survival of cells. Identification of essential proteins is helpful for understanding the minimal set of required genes in a living cell and for designing new drugs. To detect essential proteins, various computational methods have been proposed based on protein-protein interaction (PPI) networks. However, protein interaction data obtained by high-throughput experiments usually contain high false positives, which negatively impacts the accuracy of essential protein detection. Moreover, most existing studies focused on the local information of proteins in PPI networks, while ignoring the influence of indirect protein interactions on essentiality. In this paper, we propose a novel method, called Essentiality Ranking (EssRank in short), to boost the accuracy of essential protein detection. To deal with the inaccuracy of PPI data, confidence scores of interactions are evaluated by integrating various biological information. Weighted edge clustering coefficient (WECC), considering both interaction confidence scores and network topology, is proposed to calculate edge weights in PPI networks. The weight of each node is evaluated by the sum of WECC values of its linking edges. A random walk method, making use of both direct and indirect protein interactions, is then employed to calculate protein essentiality iteratively. Experimental results on the yeast PPI network show that EssRank outperforms most existing methods, including the most commonly-used centrality measures (SC, DC, BC, CC, IC, and EC), topology based methods (DMNC and NC) and the data integrating method IEW.
引用
收藏
页码:377 / 387
页数:11
相关论文
共 50 条
[1]   Towards the prediction of essential genes by integration of network topology, cellular localization and biological process information [J].
Acencio, Marcio L. ;
Lemke, Ney .
BMC BIOINFORMATICS, 2009, 10 :290
[2]  
[Anonymous], 1998, TECH REP
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   Gene expression omnibus: Microarray data storage, submission, retrieval, and analysis [J].
Barrett, Tanya ;
Edgar, Ron .
DNA MICROARRAYS, PART B: DATABASES AND STATISTICS, 2006, 411 :352-369
[5]  
BONACICH P, 1987, AM J SOCIOL, V92, P1170, DOI 10.1086/228631
[6]   Concordance analysis of microbial genomes [J].
Bruccoleri, RE ;
Dougherty, TJ ;
Davison, DB .
NUCLEIC ACIDS RESEARCH, 1998, 26 (19) :4482-4486
[7]   Detecting functional modules in the yeast protein-protein interaction network [J].
Chen, Jingchun ;
Yuan, Bo .
BIOINFORMATICS, 2006, 22 (18) :2283-2290
[8]   SGD:: Saccharomyces Genome Database [J].
Cherry, JM ;
Adler, C ;
Ball, C ;
Chervitz, SA ;
Dwight, SS ;
Hester, ET ;
Jia, YK ;
Juvik, G ;
Roe, T ;
Schroeder, M ;
Weng, SA ;
Botstein, D .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :73-79
[9]  
Chua Hon Nian, 2008, Journal of Bioinformatics and Computational Biology, V6, P435, DOI 10.1142/S0219720008003497
[10]   Targeting virulence: a new paradigm for antimicrobial therapy [J].
Clatworthy, Anne E. ;
Pierson, Emily ;
Hung, Deborah T. .
NATURE CHEMICAL BIOLOGY, 2007, 3 (09) :541-548