Node Embedding with Adaptive Similarities for Scalable Learning over Graphs

被引：6

作者：

Berberidis, Dimitris ^{[1
,2
]}

Giannakis, Georgios B. ^{[1
,2
]}

机构：

[1] Univ Minnesota, Dept Elect & Comp Engn, Minneapolis, MN 55455 USA

[2] Univ Minnesota, Digital Technol Ctr, Minneapolis, MN 55455 USA

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2021年 / 33卷 / 02期

关键词：

SVD; SVM; unsupervised; multiscale; random walks; spectral; NETWORKS;

D O I：

10.1109/TKDE.2019.2931542

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Node embedding is the task of extracting informative and descriptive features over the nodes of a graph. The importance of node embedding for graph analytics as well as learning tasks, such as node classification, link prediction, and community detection, has led to a growing interest and a number of recent advances. Nonetheless, node embedding faces several major challenges. Practical embedding methods have to deal with real-world graphs that arise from different domains, with inherently diverse underlying processes as well as similarity structures and metrics. On the other hand, similar to principal component analysis in feature vector spaces, node embedding is an inherently unsupervised task. Lacking metadata for validation, practical schemes motivate standardization and limited use of tunable hyperparameters. Finally, node embedding methods must be scalable in order to cope with large-scale real-world graphs of networks with ever-increasing size. The present work puts forth an adaptive node embedding framework that adjusts the embedding process to a given underlying graph, in a fully unsupervised manner. This is achieved by leveraging the notion of a tunable node similarity matrix that assigns weights on multihop paths. The design of multihop similarities ensures that the resultant embeddings also inherit interpretable spectral properties. The proposed model is thoroughly investigated, interpreted, and numerically evaluated using stochastic block models. Moreover, an unsupervised algorithm is developed for training the model parameters effieciently. Extensive node classification, link prediction, and clustering experiments are carried out on many real-world graphs from various domains, along with comparisons with state-of-the-art scalable and unsupervised node embedding alternatives. The proposed method enjoys superior performance in many cases, while also yielding interpretable information on the underlying graph structure.

引用

页码：637 / 650

页数：14

共 48 条

[1]

Abu-El-Haija S., 2017, ARXIV171009599

[2] Friends and neighbors on the Web [J].

Adamic, LA ;

Adar, E .

SOCIAL NETWORKS, 2003, 25 (03) :211-230

[3]

Ahmed A., 2013, Proceedings of the 22nd International Conference on World Wide Web, P37

[4]

Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027

[5]

Balasubramanian M, 2002, SCIENCE, V295

[6]

Barker B, 2015, WORKSH HIGH PERF COM, V256

[7] Adaptive Diffusions for Scalable Learning Over Graphs [J].

Berberidis, Dimitris ;

Nikolakopoulos, Athanasios N. ;

Giannakis, Georgios B. .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (05) :1307-1321

[8]

Bertsekas D., 1999, Nonlinear Programming

[9]

Bordes A, 2013, P 27 INT C NEUR INF, V2, P2787

[10]

Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401

← 1 2 3 4 5 →