An Information-Explainable Random Walk Based Unsupervised Network Representation Learning Framework on Node Classification Tasks

被引：0

作者：

Xu, Xin ^{[1
]}

Lu, Yang ^{[1
]}

Zhou, Yupeng ^{[1
]}

Fu, Zhiguo ^{[1
]}

Fu, Yanjie ^{[2
]}

Yin, Minghao ^{[1
]}

机构：

[1] Northeast Normal Univ, Coll Informat Sci & Technol, Dept Comp Sci, Changchun 130117, Peoples R China

[2] Univ Cent Florida, Coll Engn & Comp Sci, Dept Comp Sci, Orlando, FL 32816 USA

来源：

MATHEMATICS | 2021年 / 9卷 / 15期

关键词：

network representation learning; random walk; stationary distributions; unsupervised learning; network embedding; PREDICTION;

D O I：

10.3390/math9151767

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

Network representation learning aims to learn low-dimensional, compressible, and distributed representational vectors of nodes in networks. Due to the expensive costs of obtaining label information of nodes in networks, many unsupervised network representation learning methods have been proposed, where random walk strategy is one of the wildly utilized approaches. However, the existing random walk based methods have some challenges, including: 1. The insufficiency of explaining what network knowledge in the walking path-samplings; 2. The adverse effects caused by the mixture of different information in networks; 3. The poor generality of the methods with hyper-parameters on different networks. This paper proposes an information-explainable random walk based unsupervised network representation learning framework named Probabilistic Accepted Walk (PAW) to obtain network representation from the perspective of the stationary distribution of networks. In the framework, we design two stationary distributions based on nodes' self-information and local-information of networks to guide our proposed random walk strategy to learn representational vectors of networks through sampling paths of nodes. Numerous experimental results demonstrated that the PAW could obtain more expressive representation than the other six widely used unsupervised network representation learning baselines on four real-world networks in single-label and multi-label node classification tasks.

引用

页数：14

共 28 条

[21] The Graph Neural Network Model
Scarselli, Franco
Gori, Marco
Tsoi, Ah Chung
Hagenbuchner, Markus
Monfardini, Gabriele
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01): : 61 - 80
[22] The BioGRID Interaction Database: 2011 update
Stark, Chris
Breitkreutz, Bobby-Joe
Chatr-aryamontri, Andrew
Boucher, Lorrie
Oughtred, Rose
Livstone, Michael S.
Nixon, Julie
Van Auken, Kimberly
Wang, Xiaodong
Shi, Xiaoqi
Reguly, Teresa
Rust, Jennifer M.
Winter, Andrew
Dolinski, Kara
Tyers, Mike
[J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D698 - D704
[23] LINE: Large-scale Information Network Embedding
Tang, Jian
Qu, Meng
Wang, Mingzhe
Zhang, Ming
Yan, Jun
Mei, Qiaozhu
[J]. PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
[24] Tang L., 2009, KDD, P817
[25] Probabilistic principal component analysis
Tipping, ME
Bishop, CM
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 : 611 - 622
[26] Social structure of Facebook networks
Traud, Amanda L.
Mucha, Peter J.
Porter, Mason A.
[J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2012, 391 (16) : 4165 - 4180
[27] Structural Deep Network Embedding
Wang, Daixin
Cui, Peng
Zhu, Wenwu
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1225 - 1234
[28] Xu X, P 2018 IEEE INT C DA, P647

← 1 2 3 →