Word and graph attention networks for semi-supervised classification

被引:9
作者
Zhang, Jing [1 ]
Li, Mengxi [1 ]
Gao, Kaisheng [1 ]
Meng, Shunmei [1 ]
Zhou, Cangqi [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
基金
中国国家自然科学基金;
关键词
Graph neural networks; Graph embedding; Graph attention; Word attention; Semi-supervised classification;
D O I
10.1007/s10115-021-01610-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph attention networks are effective graph neural networks that perform graph embedding for semi-supervised learning, which considers the neighbors of a node when learning its features. This paper presents a novel attention-based graph neural network that introduces an attention mechanism in the word-represented features of a node together incorporating the neighbors' attention in the embedding process. Instead of using a vector as the feature of a node in the traditional graph attention networks, the proposed method uses a 2D matrix to represent a node, where each row in the matrix stands for a different attention distribution against the original word-represented features of a node. Then, the compressed features are fed into a graph attention layer that aggregates the matrix representation of the node and its neighbor nodes with different attention weights as a new representation. By stacking several graph attention layers, it obtains the final representation of nodes as matrices, which considers both that the neighbors of a node have different importance and that the words also have different importance in their original features. Experimental results on three citation network datasets show that the proposed method significantly outperforms eight state-of-the-art methods in semi-supervised classification tasks.
引用
收藏
页码:2841 / 2859
页数:19
相关论文
共 42 条
[1]  
Ahmed Amr, 2013, WWW, P37
[2]  
Ambartsoumian A, 2018, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], 2014, P SSST 8 8 WORKSH SY
[4]  
[Anonymous], 2016, P 25 INT JOINT C ART
[5]  
[Anonymous], 2013, 2 INT C LEARN REPR I
[6]  
[Anonymous], Food Additives Contaminants
[7]  
[Anonymous], 2013, COMPUTING RES REPOSI
[8]  
Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
[9]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[10]   Higher-order organization of complex networks [J].
Benson, Austin R. ;
Gleich, David F. ;
Leskovec, Jure .
SCIENCE, 2016, 353 (6295) :163-166