Large-Scale Nodes Classification With Deep Aggregation Network

被引:8
作者
Li, Jiangtao [1 ]
Wu, Jianshe [1 ,2 ]
He, Weiquan [3 ]
Zhou, Peng [1 ]
机构
[1] Xidian Univ, Key Lab Intelligent Percept & Image Understanding, Int Res Ctr Intelligent Percept & Computat, Sch Artificial Intelligence,Minist Educ, Xian 710071, Shaanxi, Peoples R China
[2] China Elect Technol Grp Corp, Xidian Joint Lab Artificial Intelligence, Res Inst 20, Xian 710071, Shaanxi, Peoples R China
[3] Xidian Univ, Sch Elect Engn, Minist Educ, Video & Image Proc Syst Lab, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Task analysis; Optimization; Scalability; Machine learning; Convolution; Neural networks; Collaboration; Graph neural network; network representation learning; node embedding; node classification; semi-supervised learning;
D O I
10.1109/TKDE.2019.2955502
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most fundamental task of network representation learning (NRL) is nodes classification which requires an algorithm to map nodes to vectors and use machine learning models to predict nodes' labels. Recently, many methods based on neighborhood aggregation have achieved brilliant results in this task. However, the recursive expansion of neighborhood aggregation poses scalability and efficiency problems for deep models. Existing methods are limited to shallow architectures and cannot capture the high order proximity in networks. In this article, we propose the deep aggregation network (DAN). DAN uses a layer-wise greedy optimization strategy which stacks several sequential trained base models to form the final deep model. The high order neighborhood aggregation is performed in a dynamic programming manner, which allows the recursion nature of neighborhood aggregation to be eliminated. The reverse random walk is also proposed, and combined with the classic random walk in formulating a novel sampling strategy that allows DAN to flexibly adapt to different tasks related to communities or structural roles. DAN is more efficient and effective than previous neighborhood aggregation based methods, especially when it is intended to handle large-scale networks with dense connections. Extensive experiments are conducted on both synthetic and real-world networks to empirically demonstrate the effectiveness and efficiency of the proposed method.
引用
收藏
页码:2560 / 2572
页数:13
相关论文
共 39 条
[1]  
[Anonymous], 2016, ARXIV160908965
[2]   Scale-Free Networks: A Decade and Beyond [J].
Barabasi, Albert-Laszlo .
SCIENCE, 2009, 325 (5939) :412-413
[3]  
Bernardo J.M., 2009, Bayesian theory, V405
[4]   Geometric Deep Learning Going beyond Euclidean data [J].
Bronstein, Michael M. ;
Bruna, Joan ;
LeCun, Yann ;
Szlam, Arthur ;
Vandergheynst, Pierre .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (04) :18-42
[5]  
Bruna J., 2014, P 2 INT C LEARN REPR, P1, DOI 10.48550/arXiv.1312.6203
[6]   A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].
Cai, HongYun ;
Zheng, Vincent W. ;
Chang, Kevin Chen-Chuan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637
[7]  
Chen HC, 2018, AAAI CONF ARTIF INTE, P2127
[8]  
Chen JF, 2018, PR MACH LEARN RES, V80
[9]   How Do the Open Source Communities Address Usability and UX Issues? An Exploratory Study [J].
Cheng, Jinghui ;
Guo, Jin L. C. .
CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
[10]   Diffusion maps [J].
Coifman, Ronald R. ;
Lafon, Stephane .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2006, 21 (01) :5-30