Centrality Combination Method Based on Feature Selection for Protein Interaction Networks

被引:1
作者
Wang, Haoyue [1 ]
Pan, Li [1 ]
Sun, Jing [1 ]
Li, Bin [1 ]
Jiang, Junqiang [1 ]
Yang, Bo [1 ]
Li, Wenbin [1 ]
机构
[1] Hunan Inst Sci & Technol, Dept Informat Sci & Engn, Yueyang 414006, Peoples R China
关键词
Proteins; Gene expression; Feature extraction; Correlation; Sensitivity; Redundancy; Topology; Interactive systems; Centrality methods; combination method; essential proteins; feature selection; protein interaction networks; IDENTIFICATION; DATABASE;
D O I
10.1109/ACCESS.2022.3216416
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Essential proteins are important participants in various life activities and play a vital role in the survival and reproduction of life. The network-based centrality methods are a common way to identify essential proteins for protein interaction networks. Due to the differences between the existing centrality methods, it is a feasible approach to improve the identification accuracy of essential proteins by combining centrality methods. In this paper, we propose a centrality combination method based on feature selection. First, the measure values of the 14 classical centrality methods are viewed as feature data. Then, a subset of the relevant features is selected according to the importance of features. Finally, the centrality methods corresponding to the selected features are combined by using the geometric mean method for the identification of essential proteins. To verify the effectiveness of the combination method, we apply the combination method on the original static protein interaction network (SPIN), the dynamic protein interaction network (DPIN) and the refined dynamic protein interaction network (RDPIN), and compare the result with those by each single centrality method (LAC, DC, DMNC, NC, TP, CLC, BC, LC, CC, KC, CR, EC, PR, LR). The experimental results on the identification of essential proteins shows that the combination method achieves better results in prediction performance than the 14 centrality mehtods in terms of the prediction precision, sensitivity, specificity, positive predictive value, negative predictive value, F-measure and accuracy rate. It has been illustrated that the proposed method can help to identify essential proteins more accurately.
引用
收藏
页码:112028 / 112042
页数:15
相关论文
共 35 条
  • [1] COMPARTMENTS: unification and visualization of protein subcellular localization evidence
    Binder, Janos X.
    Pletscher-Frankild, Sune
    Tsafou, Kalliopi
    Stolte, Christian
    O'Donoghue, Sean I.
    Schneider, Reinhard
    Jensen, Lars Juhl
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2014,
  • [2] BONACICH P, 1987, AM J SOCIOL, V92, P1170, DOI 10.1086/228631
  • [3] On variants of shortest-path betweenness centrality and their generic computation
    Brandes, Ulrik
    [J]. SOCIAL NETWORKS, 2008, 30 (02) : 136 - 145
  • [4] The anatomy of a large-scale hypertextual Web search engine
    Brin, S
    Page, L
    [J]. COMPUTER NETWORKS AND ISDN SYSTEMS, 1998, 30 (1-7): : 107 - 117
  • [5] Genome-wide screening for gene function using RNAi in mammalian cells
    Cullen, LM
    Arndt, GM
    [J]. IMMUNOLOGY AND CELL BIOLOGY, 2005, 83 (03) : 217 - 223
  • [6] How to identify essential genes from molecular networks?
    del Rio, Gabriel
    Koschuetzki, Dirk
    Coello, Gerardo
    [J]. BMC SYSTEMS BIOLOGY, 2009, 3 : 102
  • [7] Characterizing cycle structure in complex networks
    Fan, Tianlong
    Lu, Linyuan
    Shi, Dinghua
    Zhou, Tao
    [J]. COMMUNICATIONS PHYSICS, 2021, 4 (01)
  • [8] Variable selection using random forests
    Genuer, Robin
    Poggi, Jean-Michel
    Tuleau-Malot, Christine
    [J]. PATTERN RECOGNITION LETTERS, 2010, 31 (14) : 2225 - 2236
  • [9] Functional profiling of the Saccharomyces cerevisiae genome
    Giaever, G
    Chu, AM
    Ni, L
    Connelly, C
    Riles, L
    Véronneau, S
    Dow, S
    Lucau-Danila, A
    Anderson, K
    André, B
    Arkin, AP
    Astromoff, A
    El Bakkoury, M
    Bangham, R
    Benito, R
    Brachat, S
    Campanaro, S
    Curtiss, M
    Davis, K
    Deutschbauer, A
    Entian, KD
    Flaherty, P
    Foury, F
    Garfinkel, DJ
    Gerstein, M
    Gotte, D
    Güldener, U
    Hegemann, JH
    Hempel, S
    Herman, Z
    Jaramillo, DF
    Kelly, DE
    Kelly, SL
    Kötter, P
    LaBonte, D
    Lamb, DC
    Lan, N
    Liang, H
    Liao, H
    Liu, L
    Luo, CY
    Lussier, M
    Mao, R
    Menard, P
    Ooi, SL
    Revuelta, JL
    Roberts, CJ
    Rose, M
    Ross-Macdonald, P
    Scherens, B
    [J]. NATURE, 2002, 418 (6896) : 387 - 391
  • [10] Computational prediction of essential genes in an unculturable endosymbiotic bacterium, Wolbachia of Brugia malayi
    Holman, Alexander G.
    Davis, Paul J.
    Foster, Jeremy M.
    Carlow, Clotilde K. S.
    Kumar, Sanjay
    [J]. BMC MICROBIOLOGY, 2009, 9