Learning High-Order Graph Convolutional Networks via Adaptive Layerwise Aggregation Combination

被引:8
作者
Zhang, Tianqi [1 ,2 ]
Wu, Qitian [1 ,2 ]
Yan, Junchi [1 ,2 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, AI Inst, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, AI Inst, Shanghai 200240, Peoples R China
关键词
Adaptive combination methods; graph neural networks (GNNs); graph representation learning; high-order graph convolutional networks;
D O I
10.1109/TNNLS.2021.3119958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph convolutional networks have attracted wide attention for their expressiveness and empirical success on graph-structured data. However, deeper graph convolutional networks with access to more information can often perform worse because their low-order Chebyshev polynomial approximation cannot learn adaptive and structure-aware representations. To solve this problem, many high-order graph convolution schemes have been proposed. In this article, we study the reason why high-order schemes have the ability to learn structure-aware representations. We first prove that these high-order schemes are generalized Weisfeiler-Lehman (WL) algorithm and conduct spectral analysis on these schemes to show that they correspond to polynomial filters in the graph spectral domain. Based on our analysis, we point out twofold limitations of existing high-order models: 1) lack mechanisms to generate individual feature combinations for each node and 2) fail to properly model the relationship between information from different distances. To enable a node-specific combination scheme and capture this interdistance relationship for each node efficiently, we propose a new adaptive feature combination method inspired by the squeeze-and-excitation module that can recalibrate features from different distances by explicitly modeling interdependencies between them. Theoretical analysis shows that models with our new approach can effectively learn structure-aware representations, and extensive experimental results show that our new approach can achieve significant performance gain compared with other high-order schemes.
引用
收藏
页码:5144 / 5155
页数:12
相关论文
共 47 条
  • [1] Abu-El-Haija S, 2019, P C UNC ART INT UAI, P310
  • [2] Abu-El-Haija S., 2019, INT C MACH LEARN, P21
  • [3] Babai L., 1979, 20th Annual Symposium of Foundations of Computer Science, P39, DOI 10.1109/SFCS.1979.8
  • [4] Bahdanau D, 2016, Arxiv, DOI [arXiv:1409.0473, 10.48550/arXiv.1409.0473]
  • [5] Battaglia PW, 2016, ADV NEUR IN, V29
  • [6] Bruna Joan, 2014, C TRACK P
  • [7] AN OPTIMAL LOWER BOUND ON THE NUMBER OF VARIABLES FOR GRAPH IDENTIFICATION
    CAI, JY
    FURER, M
    IMMERMAN, N
    [J]. COMBINATORICA, 1992, 12 (04) : 389 - 410
  • [8] Chapelle A. Z. O, 2006, Semisupervised Learning
  • [9] Chen JF, 2018, PR MACH LEARN RES, V80
  • [10] Chen Ming, 2020, P MACHINE LEARNING R, V119