The Network Representation Learning Algorithm Based on Semi-Supervised Random Walk

被引:2
作者
Liu, Dong [1 ,2 ,3 ]
Li, Qinpeng [1 ,2 ]
Ru, Yan [1 ,2 ]
Zhang, Jun [4 ]
机构
[1] Henan Normal Univ, Sch Comp & Informat Engn, Xinxiang 453007, Henan, Peoples R China
[2] Henan Normal Univ, Big Data Engn Lab Teaching Resources & Assessment, Xinxiang 453007, Henan, Peoples R China
[3] Key Lab Artificial Intelligence & Personalized Le, Xinxiang 453007, Henan, Peoples R China
[4] Zhengzhou Univ, Sch Mech Engn, Zhengzhou 450001, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Clustering algorithms; Matrix decomposition; Licenses; Complex networks; Topology; Symmetric matrices; Network representation learning; semi-supervised; pairwise constraints; community structure; random walk;
D O I
10.1109/ACCESS.2020.3044367
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
As an important tool of social network analysis, network representation learning also called network embedding maps the network to a latent space and learns low-dimensional and dense real vectors of nodes, while preserving the structure and internal attributes of network. The learned representations or embedding vectors can be used for node clustering, link prediction, network visualization and other tasks for network analysis. Most of the existing network representation learning algorithms mainly focus on the preservation of micro or macro network structure, ignoring the mesoscopic community structure information. Although a few network embedding methods are proposed to preserve the community structure, they all ignore the prior information about communities. Inspired by the semi-supervised community detection in complex networks, in this article, a novel Semi-Supervised DeepWalk method(SSDW) is proposed for network representation learning, which successfully preserves the community structure of network in the embedding space. Specifically, a semi-supervised random walk sampling method which effectively integrates the pairwise constraints is proposed. By doing so, the SSDW model can guide the transition probability in the random walk process and obtain the node context sequence in line with the prior knowledge. The experimental results on eight real networks show that comparing with the popular network embedding methods, the node representation vectors integrating pairwise constraints into the random walk process can obtain higher accuracy on node clustering task, and the results of link prediction, network visualization tasks indicate that the semi-supervised model SSDW is more discriminative than unsupervised ones.
引用
收藏
页码:222956 / 222965
页数:10
相关论文
共 31 条
  • [1] Graph Regularized Nonnegative Matrix Factorization for Data Representation
    Cai, Deng
    He, Xiaofei
    Han, Jiawei
    Huang, Thomas S.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1548 - 1560
  • [2] Cao S., 2015, P 24 ACM INT C INF K, P891, DOI DOI 10.1145/2806416.2806512
  • [3] A Survey on Network Embedding
    Cui, Peng
    Wang, Xiao
    Pei, Jian
    Zhu, Wenwu
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (05) : 833 - 852
  • [4] Semi-Supervised Community Detection Based on Distance Dynamics
    Fan, Lilin
    Xu, Shengli
    Liu, Dong
    Ru, Yan
    [J]. IEEE ACCESS, 2018, 6 : 37261 - 37271
  • [5] BiNE: Bipartite Network Embedding
    Gao, Ming
    Chen, Leihui
    He, Xiangnan
    Zhou, Aoying
    [J]. ACM/SIGIR PROCEEDINGS 2018, 2018, : 715 - 724
  • [6] Community structure in social and biological networks
    Girvan, M
    Newman, MEJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (12) : 7821 - 7826
  • [7] Graph embedding techniques, applications, and performance: A survey
    Goyal, Palash
    Ferrara, Emilio
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 151 : 78 - 94
  • [8] node2vec: Scalable Feature Learning for Networks
    Grover, Aditya
    Leskovec, Jure
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 855 - 864
  • [9] Community aware random walk for network embedding
    Keikha, Mohammad Mehdi
    Rahgozar, Maseud
    Asadpour, Masoud
    [J]. KNOWLEDGE-BASED SYSTEMS, 2018, 148 : 47 - 54
  • [10] Kipf TN, 2016, ARXIV, DOI DOI 10.48550/ARXIV.1609.02907