An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks

被引:0
作者
Xu, Xin [1 ]
Lu, Yang [1 ]
Zhou, Yupeng [1 ]
Fu, Zhiguo [1 ]
Fu, Yanjie [2 ]
Yin, Minghao [1 ]
机构
[1] Northeast Normal Univ, Coll Informat Sci & Technol, Dept Comp Sci, Changchun 130117, Peoples R China
[2] Univ Cent Florida, Coll Engn & Comp Sci, Dept Comp Sci, Orlando, FL 32816 USA
关键词
network representation learning; random walk; stationary distributions; unsupervised learning; network embedding; PREDICTION;
D O I
10.3390/math9151767
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes' self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.
引用
收藏
页数:14
相关论文
共 28 条
  • [21] The Graph Neural Network Model
    Scarselli, Franco
    Gori, Marco
    Tsoi, Ah Chung
    Hagenbuchner, Markus
    Monfardini, Gabriele
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01): : 61 - 80
  • [22] The BioGRID Interaction Database: 2011 update
    Stark, Chris
    Breitkreutz, Bobby-Joe
    Chatr-aryamontri, Andrew
    Boucher, Lorrie
    Oughtred, Rose
    Livstone, Michael S.
    Nixon, Julie
    Van Auken, Kimberly
    Wang, Xiaodong
    Shi, Xiaoqi
    Reguly, Teresa
    Rust, Jennifer M.
    Winter, Andrew
    Dolinski, Kara
    Tyers, Mike
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D698 - D704
  • [23] LINE: Large-scale Information Network Embedding
    Tang, Jian
    Qu, Meng
    Wang, Mingzhe
    Zhang, Ming
    Yan, Jun
    Mei, Qiaozhu
    [J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
  • [24] Tang L., 2009, KDD, P817
  • [25] Probabilistic principal component analysis
    Tipping, ME
    Bishop, CM
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 611 - 622
  • [26] Social structure of Facebook networks
    Traud, Amanda L.
    Mucha, Peter J.
    Porter, Mason A.
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (16) : 4165 - 4180
  • [27] Structural Deep Network Embedding
    Wang, Daixin
    Cui, Peng
    Zhu, Wenwu
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1225 - 1234
  • [28] Xu X, P 2018 IEEE INT C DA, P647