Network Representation Learning Guided by Partial Community Structure

被引:2
作者
Sun, Hanlin [1 ,2 ]
Jie, Wei [3 ]
Wang, Zhongmin [1 ,2 ]
Wang, Hai [4 ]
Ma, Sugang [1 ,2 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian 710121, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent P, Xian 710121, Peoples R China
[3] Univ West London, Sch Comp & Engn, London W5 5RF, England
[4] Aston Univ, Sch Engn & Appl Sci, Birmingham B4 7ET, W Midlands, England
来源
IEEE ACCESS | 2020年 / 8卷
基金
中国国家自然科学基金;
关键词
Network embedding; network representation learning; partial community structure; community structure; multi-label classification; link prediction;
D O I
10.1109/ACCESS.2020.2978517
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Network Representation Learning (NRL) is an effective way to analyze large scale networks (graphs). In general, it maps network nodes, edges, subgraphs, etc. onto independent vectors in a low dimension space, thus facilitating network analysis tasks. As community structure is one of the most prominent mesoscopic structure properties of real networks, it is necessary to preserve community structure of networks during NRL. In this paper, the concept of k-step partial community structure is defined and two Partial Community structure Guided Network Embedding (PCGNE) methods, based on two popular NRL algorithms (DeepWalk and node2vec respectively), for node representation learning are proposed. The idea behind this is that it is easier and more cost-effective to find a higher quality 1-step partial community structure than a higher quality whole community structure for networks; the extracted partial community information is then used to guide random walks in DeepWalk or node2vec. As a result, the learned node representations could preserve community structure property of networks more effectively. The two proposed algorithms and six state-of-the-art NRL algorithms were examined through multi-label classification and (inner community) link prediction on eight synthesized networks: one where community structure property could be controlled, and one real world network. The results suggested that the two PCGNE methods could improve the performance of their own based algorithm significantly and were competitive for node representation learning. Especially, comparing against used baseline algorithms, PCGNE methods could capture overlapping community structure much better, and thus could achieve better performance for multi-label classification on networks that have more overlapping nodes and/or larger overlapping memberships.
引用
收藏
页码:46665 / 46681
页数:17
相关论文
共 36 条
  • [1] [Anonymous], P 12 INT C INT SYST
  • [2] [Anonymous], ARXIV170905584
  • [3] [Anonymous], IEEE T BIG DATA
  • [4] A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications
    Cai, HongYun
    Zheng, Vincent W.
    Chang, Kevin Chen-Chuan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) : 1616 - 1637
  • [5] Cao S., 2015, P 24 ACM INT C INF K, P891, DOI DOI 10.1145/2806416.2806512
  • [6] Learning Community Embedding with Community Detection and Node Embedding on Graphs
    Cavallari, Sandro
    Zheng, Vincent W.
    Cai, Hongyun
    Chang, Kevin Chen-Chuan
    Cambria, Erik
    [J]. CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 377 - 386
  • [7] Chang C.-C., 2011, ACM T INTEL SYST TEC, V2, P27, DOI DOI 10.1145/1961189.1961199
  • [8] Chen MM, 2014, 2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), P856, DOI 10.1109/ASONAM.2014.6921686
  • [9] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [10] Graph embedding techniques, applications, and performance: A survey
    Goyal, Palash
    Ferrara, Emilio
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 78 - 94