On the Equivalence of Decoupled Graph Convolution Network and Label Propagation

被引:44
作者
Dong, Hande [1 ]
Chen, Jiawei [1 ]
Feng, Fuli [2 ]
He, Xiangnan [1 ]
Bi, Shuxian [1 ]
Ding, Zhaolin [3 ]
Cui, Peng [4 ]
机构
[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[2] Natl Univ Singapore, Singapore, Singapore
[3] North Carolina State Univ, Raleigh, NC USA
[4] Tsinghua Univ, Beijing, Peoples R China
来源
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年
基金
中国国家自然科学基金;
关键词
Graph Convolution Network; Graph Neural Networks; Decoupled Graph Neural Network; Label Propagation;
D O I
10.1145/3442381.3449927
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The original design of Graph Convolution Network (GCN) couples feature transformation and neighborhood aggregation for node representation learning. Recently, some work shows that coupling is inferior to decoupling, which supports deep graph propagation better and has become the latest paradigm of GCN (e.g., APPNP [16] and SGCN [32]). Despite effectiveness, the working mechanisms of the decoupled GCN are not well understood. In this paper, we explore the decoupled GCN for semi-supervised node classification from a novel and fundamental perspective label propagation. We conduct thorough theoretical analyses, proving that the decoupled GCN is essentially the same as the two-step label propagation: first, propagating the known labels along the graph to generate pseudo-labels for the unlabeled nodes, and second, training normal neural network classifiers on the augmented pseudo-labeled data. More interestingly, we reveal the effectiveness of decoupled GCN: going beyond the conventional label propagation, it could automatically assign structure- and model- aware weights to the pseudo-label data. This explains why the decoupled GCN is relatively robust to the structure noise and over-smoothing, but sensitive to the label noise and model initialization. Based on this insight, we propose a new label propagation method named Propagation then Training Adaptively (PTA), which overcomes the flaws of the decoupled GCN with a dynamic and adaptive weighting strategy. Our PTA is simple yet more effective and robust than decoupled GCN. We empirically validate our findings on four benchmark datasets, demonstrating the advantages of our method. The code is available at https://github.com/DongHande/PT_propagation_then_training.
引用
收藏
页码:3651 / 3662
页数:12
相关论文
共 45 条
  • [1] Abu-El-Haija S, 2019, UAI, P1
  • [2] Bruna J., 2014, C TRACK P
  • [3] A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications
    Cai, HongYun
    Zheng, Vincent W.
    Chang, Kevin Chen-Chuan
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) : 1616 - 1637
  • [4] Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438
  • [5] Chen WJ, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2116
  • [6] Defferrard M, 2016, ADV NEUR IN, V29
  • [7] Frasca Fabrizio, 2020, ICML 2020 WORKSH GRA
  • [8] Garg VK, 2020, PR MACH LEARN RES, V119
  • [9] Hamilton WL, 2017, ADV NEUR IN, V30
  • [10] He K., 2015, ARXIV, p1505.00996, DOI 10.48550/arXiv