On the Equivalence of Decoupled Graph Convolution Network and Label Propagation

被引：58

作者：

Dong, Hande ^{[1
]}

Chen, Jiawei ^{[1
]}

Feng, Fuli ^{[2
]}

He, Xiangnan ^{[1
]}

Bi, Shuxian ^{[1
]}

Ding, Zhaolin ^{[3
]}

Cui, Peng ^{[4
]}

机构：

[1] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

[2] Natl Univ Singapore, Singapore, Singapore

[3] North Carolina State Univ, Raleigh, NC USA

[4] Tsinghua Univ, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

Graph Convolution Network; Graph Neural Networks; Decoupled Graph Neural Network; Label Propagation;

D O I：

10.1145/3442381.3449927

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The original design of Graph Convolution Network (GCN) couples feature transformation and neighborhood aggregation for node representation learning. Recently, some work shows that coupling is inferior to decoupling, which supports deep graph propagation better and has become the latest paradigm of GCN (e.g., APPNP [16] and SGCN [32]). Despite effectiveness, the working mechanisms of the decoupled GCN are not well understood. In this paper, we explore the decoupled GCN for semi-supervised node classification from a novel and fundamental perspective label propagation. We conduct thorough theoretical analyses, proving that the decoupled GCN is essentially the same as the two-step label propagation: first, propagating the known labels along the graph to generate pseudo-labels for the unlabeled nodes, and second, training normal neural network classifiers on the augmented pseudo-labeled data. More interestingly, we reveal the effectiveness of decoupled GCN: going beyond the conventional label propagation, it could automatically assign structure- and model- aware weights to the pseudo-label data. This explains why the decoupled GCN is relatively robust to the structure noise and over-smoothing, but sensitive to the label noise and model initialization. Based on this insight, we propose a new label propagation method named Propagation then Training Adaptively (PTA), which overcomes the flaws of the decoupled GCN with a dynamic and adaptive weighting strategy. Our PTA is simple yet more effective and robust than decoupled GCN. We empirically validate our findings on four benchmark datasets, demonstrating the advantages of our method. The code is available at https://github.com/DongHande/PT_propagation_then_training.

引用

页码：3651 / 3662

页数：12

共 45 条

[1]

Abu-El-Haija S., 2019, C UNC ART INT UAI, V115, P841

[2]

Brin Sergey, 1998, Technical report, V98, P161

[3]

Bruna Joan, 2014, P INT C LEARN REPR

[4] A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].

Cai, HongYun ;

Zheng, Vincent W. ;

Chang, Kevin Chen-Chuan .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637

[5]

Chen DL, 2020, AAAI CONF ARTIF INTE, V34, P3438

[6]

Chen WJ, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2116

[7]

Defferrard M, 2016, ADV NEUR IN, V29

[8]

Frasca F., 2020, ICML 2020 WORKSH GRA, P1

[9]

Garg VK, 2020, PR MACH LEARN RES, V119

[10]

Hamilton WL, 2017, ADV NEUR IN, V30

← 1 2 3 4 5 →