Graph InfoClust: Maximizing Coarse-Grain Mutual Information in Graphs

被引:46
作者
Mavromatis, Costas [1 ]
Karypis, George [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I | 2021年 / 12712卷
关键词
D O I
10.1007/978-3-030-75762-5_43
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work proposes a new unsupervised (or self-supervised) node representation learning method that aims to leverage the coarse-grain information that is available in most graphs. This extends previous attempts that only leverage fine-grain information (similarities within local neighborhoods) or global graph information (similarities across all nodes). Intuitively, the proposed method identifies nodes that belong to the same clusters and maximizes their mutual information. Thus, coarse-grain (cluster-level) similarities that are shared between nodes are preserved in their representations. The core components of the proposed method are (i) a jointly optimized clustering of nodes during learning and (ii) an Infomax objective term that preserves the mutual information among nodes of the same clusters. Our method is able to outperform competing state-of-art methods in various downstream tasks, such as node classification, link prediction, and node clustering. Experiments show that the average gain is between 0.2% and 6.1%, over the best competing approach, over all tasks. Our code is publicly available at: https://github.com/cmavro/Graph-InfoClust-GIC.
引用
收藏
页码:541 / 553
页数:13
相关论文
共 25 条
[1]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[2]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[3]  
Glorot X., 2010, Proceedings of the thirteenth international conference on artificial intelligence and statistics, P249, DOI DOI 10.1109/LGRS.2016.2565705
[4]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864
[5]  
Hamilton WL, 2017, ADV NEUR IN, V30
[6]  
Hassani K, 2020, PR MACH LEARN RES, V119
[7]  
Hjelm R. D., 2019, PROC INT C LEARN REP, P24
[8]  
Kingma DP, 2014, ADV NEUR IN, V27
[9]  
Kipf TN, 2016, ARXIV
[10]  
Kipf TN, 2016, NIPS WORKSH BAYES DE