The impossibility of low-rank representations for triangle-rich complex networks

被引：42

作者：

Seshadhri, C. ^{[1
]}

Sharma, Aneesh ^{[2
]}

Stolman, Andrew ^{[1
]}

Goel, Ashish ^{[3
]}

机构：

[1] Univ Calif Santa Cruz, Dept Comp Sci, Santa Cruz, CA 95064 USA

[2] Google, Mountain View, CA 94043 USA

[3] Stanford Univ, Dept Management Sci & Engn, Stanford, CA 94305 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2020年 / 117卷 / 11期

关键词：

graph embeddings; graph representations; low-dimensional embeddings; low-rank representations; singular value decomposition;

D O I：

10.1073/pnas.1911030117

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The study of complex networks is a significant development in modern science, and has enriched the social sciences, biology, physics, and computer science. Models and algorithms for such networks are pervasive in our society, and impact human behavior via social networks, search engines, and recommender systems, to name a few. A widely used algorithmic technique for modeling such complex networks is to construct a low-dimensional Euclidean embedding of the vertices of the network, where proximity of vertices is interpreted as the likelihood of an edge. Contrary to the common view, we argue that such graph embeddings do not capture salient properties of complex networks. The two properties we focus on are low degree and large clustering coefficients, which have been widely established to be empirically true for real-world networks. We mathematically prove that any embedding (that uses dot products to measure similarity) that can successfully create these two properties must have a rank that is nearly linear in the number of vertices. Among other implications, this establishes that popular embedding techniques such as singular value decomposition and node2vec fail to capture significant structural aspects of real-world complex networks. Furthermore, we empirically study a number of different embedding techniques based on dot product, and show that they all fail to capture the triangle structure.

引用

页码：5631 / 5637

页数：7

共 31 条

[11] FIEDLER M, 1973, CZECH MATH J, V23, P298
[12] node2vec: Scalable Feature Learning for Networks
Grover, Aditya
Leskovec, Jure
[J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 855 - 864
[13] Gupta Pankaj, 2013, P 22 INT C WORLD WID, P505, DOI DOI 10.1145/2488388.2488433
[14] Hamilton W. L., 2017, ADV NEURAL INFORM PR, P1025, DOI DOI 10.48550/ARXIV.1706.02216
[15] Latent space approaches to social network analysis
Hoff, PD
Raftery, AE
Handcock, MS
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (460) : 1090 - 1098
[16] STOCHASTIC BLOCKMODELS - 1ST STEPS
HOLLAND, PW
LASKEY, KB
LEINHARDT, S
[J]. SOCIAL NETWORKS, 1983, 5 (02) : 109 - 137
[17] Block models and personalized PageRank
Kloumann, Isabel M.
Ugander, Johan
Kleinberg, Jon
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (01) : 33 - 38
[18] Leskovec J., Stanford Network Analysis Project
[19] The structure and function of complex networks
Newman, MEJ
[J]. SIAM REVIEW, 2003, 45 (02) : 167 - 256
[20] Perozzi Bryan, 2014, P 20 ACM SIGKDD INT, DOI [DOI 10.1145/2623330.2623732, 10 . 1145 / 2623330 . 2623732. arXiv: 1403.6652]

← 1 2 3 4 →