Data Augmentation for Graph Convolutional Network on Semi-supervised Classification

被引:6
作者
Tang, Zhengzheng [1 ,2 ]
Qiao, Ziyue [1 ,2 ]
Hong, Xuehai [1 ,3 ]
Wang, Yang [2 ]
Dharejo, Fayaz Ali [1 ,2 ]
Zhou, Yuanchun [2 ]
Du, Yi [2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
来源
WEB AND BIG DATA, APWEB-WAIM 2021, PT II | 2021年 / 12859卷
基金
北京市自然科学基金;
关键词
Data augmentation; Graph Convolutional Network; Semi-supervised classification;
D O I
10.1007/978-3-030-85899-5_3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data augmentation aims to generate new and synthetic features from the original data, which can identify a better representation of data and improve the performance and generalizability of downstream tasks. However, data augmentation for graph-based models remains a challenging problem, as graph data is more complex than traditional data, which consists of two features with different properties: graph topology and node attributes. In this paper, we study the problem of graph data augmentation for Graph Convolutional Network (GCN) in the context of improving the node embeddings for semi-supervised node classification. Specifically, we conduct cosine similarity based cross operation on the original features to create new graph features, including new node attributes and new graph topologies, and we combine them as new pairwise inputs for specific GCNs. Then, we propose an attentional integrating model to weighted sum the hidden node embeddings encoded by these GCNs into the final node embeddings. We also conduct a disparity constraint on these hidden node embeddings when training to ensure that non-redundant information is captured from different features. Experimental results on five real-world datasets show that our method improves the classification accuracy with a clear margin (+2.5%-+84.2%) than the original GCN model.
引用
收藏
页码:33 / 48
页数:16
相关论文
共 32 条
[1]  
Abu-EL-Haija Sami, 2019, ICML, V97, P21
[2]   How Do the Open Source Communities Address Usability and UX Issues? An Exploratory Study [J].
Cheng, Jinghui ;
Guo, Jin L. C. .
CHI 2018: EXTENDED ABSTRACTS OF THE 2018 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2018,
[3]  
Defferrard M, 2016, ADV NEUR IN, V29
[4]  
Dong H, 2020, DATA AUGMENTATION VI
[5]  
Elickovic P.V., 2018, ICLR
[6]  
Fawaz HI, 2018, IEEE INT CONF BIG DA, P1367, DOI 10.1109/BigData.2018.8621990
[7]  
Frid-Adar M, 2018, I S BIOMED IMAGING, P289, DOI 10.1109/ISBI.2018.8363576
[8]  
Glorot X., 2010, JMLR WORKSHOP C P, V13, P249, DOI DOI 10.1167/I0VS.08-2926
[9]  
Hamilton WL, 2017, ADV NEUR IN, V30
[10]  
Kipf T. N., 2016, ARXIV