Motif-based spectral clustering of weighted directed networks

被引:11
作者
Underwood, William G. [1 ,2 ]
Elliott, Andrew [3 ,4 ]
Cucuringu, Mihai [1 ,3 ]
机构
[1] Univ Oxford, Dept Stat, 24-29 St Giles, Oxford OX1 3LB, England
[2] Princeton Univ, Dept Operat Res & Financial Engn, Sherrerd Hall,Charlton St, Princeton, NJ 08544 USA
[3] British Lib, Alan Turing Inst, 96 Euston Rd, London NW1 2DB, England
[4] Univ Glasgow, Sch Math & Stat, Glasgow GL12 8QQ, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Motif; Spectral clustering; Weighted network; Directed network; Community detection; Graph Laplacian; Bipartite network; GRAPHS;
D O I
10.1007/s41109-020-00293-z
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Clustering is an essential technique for network analysis, with applications in a diverse range of fields. Although spectral clustering is a popular and effective method, it fails to consider higher-order structure and can perform poorly on directed networks. One approach is to capture and cluster higher-order structures using motif adjacency matrices. However, current formulations fail to take edge weights into account, and thus are somewhat limited when weight is a key component of the network under study.We address these shortcomings by exploring motif-based weighted spectral clustering methods. We present new and computationally useful matrix formulae for motif adjacency matrices on weighted networks, which can be used to construct efficient algorithms for any anchored or non-anchored motif on three nodes. In a very sparse regime, our proposed method can handle graphs with a million nodes and tens of millions of edges. We further use our framework to construct a motif-based approach for clustering bipartite networks.We provide comprehensive experimental results, demonstrating (i) the scalability of our approach, (ii) advantages of higher-order clustering on synthetic examples, and (iii) the effectiveness of our techniques on a variety of real world data sets; and compare against several techniques from the literature. We conclude that motif-based spectral clustering is a valuable tool for analysis of directed and bipartite weighted networks, which is also scalable and easy to implement.
引用
收藏
页数:41
相关论文
共 80 条
[1]  
Adamic L. A., 2005, P 3 INT WORKSH LINK, P36, DOI DOI 10.1145/1134271.1134277
[2]   Efficient Graphlet Counting for Large Networks [J].
Ahmed, Nesreen K. ;
Neville, Jennifer ;
Rossi, Ryan A. ;
Duffield, Nick .
2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, :1-10
[3]  
Aicher C, 2013, ADAPTING STOCHASTIC
[4]   Learning latent block structure in weighted networks [J].
Aicher, Christopher ;
Jacobs, Abigail Z. ;
Clauset, Aaron .
JOURNAL OF COMPLEX NETWORKS, 2015, 3 (02) :221-248
[5]   Scale-free networks in cell biology [J].
Albert, R .
JOURNAL OF CELL SCIENCE, 2005, 118 (21) :4947-4957
[6]  
[Anonymous], 1959, Publ. Math. Debr.
[7]  
Arthur D, 2007, PROCEEDINGS OF THE EIGHTEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P1027
[8]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[9]   Simplicial closure and higher-order link prediction [J].
Benson, Austin R. ;
Abebe, Rediet ;
Schaub, Michael T. ;
Jadbabaie, Ali ;
Kleinberg, Jon .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (48) :E11221-E11230
[10]   Higher-order organization of complex networks [J].
Benson, Austin R. ;
Gleich, David F. ;
Leskovec, Jure .
SCIENCE, 2016, 353 (6295) :163-166