Centrality Measures for Graphons: Accounting for Uncertainty in Networks

被引:45
作者
Avella-Medina, Marco [1 ]
Parise, Francesca [2 ]
Schaub, Michael T. [3 ,4 ]
Segarra, Santiago [5 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
[2] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
[3] MIT, Inst Data Syst & Soc, Cambridge, MA 02139 USA
[4] Univ Oxford, Dept Engn Sci, Oxford OX1 2JD, England
[5] Rice Univ, Dept Elect & Comp Engn, Houston, TX 77005 USA
来源
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING | 2020年 / 7卷 / 01期
基金
欧盟地平线“2020”; 瑞士国家科学基金会;
关键词
Random graph theory; networks; graphons; centrality measures; stochastic block model; STOCHASTIC BLOCKMODELS; CONVERGENT SEQUENCES; MODELS; FREQUENCIES; STABILITY; ARRAYS; LIMITS;
D O I
10.1109/TNSE.2018.2884235
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
As relational datasets modeled as graphs keep increasing in size and their data-acquisition is permeated by uncertainty, graph-based analysis techniques can become computationally and conceptually challenging. In particular, node centrality measures rely on the assumption that the graph is perfectly known - a premise not necessarily fulfilled for large, uncertain networks. Accordingly, centrality measures may fail to faithfully extract the importance of nodes in the presence of uncertainty. To mitigate these problems, we suggest a statistical approach based on graphon theory: we introduce formal definitions of centrality measures for graphons and establish their connections to classical graph centrality measures. A key advantage of this approach is that centrality measures defined at the modeling level of graphons are inherently robust to stochastic variations of specific graph realizations. Using the theory of linear integral operators, we define degree, eigenvector, Katz and PageRank centrality functions for graphons and establish concentration inequalities demonstrating that graphon centrality functions arise naturally as limits of their counterparts defined on sequences of graphs of increasing size. The same concentration inequalities also provide high-probability bounds between the graphon centrality functions and the centrality measures on any sampled graph, thereby establishing a measure of uncertainty of the measured centrality score.
引用
收藏
页码:520 / 537
页数:18
相关论文
共 59 条
[1]   REPRESENTATIONS FOR PARTIALLY EXCHANGEABLE ARRAYS OF RANDOM-VARIABLES [J].
ALDOUS, DJ .
JOURNAL OF MULTIVARIATE ANALYSIS, 1981, 11 (04) :581-598
[2]  
[Anonymous], P 2017 ACM C EC COMP
[3]  
[Anonymous], 2013, Nonparametric graphon estimation
[4]  
[Anonymous], 2003, P ACM SIGKDD
[5]  
[Anonymous], 1979, PREPRINT
[6]  
[Anonymous], J MACH LEARN RES
[7]  
[Anonymous], ARXIV170910402
[8]  
[Anonymous], 2015, CLASS RANDOM GRAPHS
[9]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[10]  
[Anonymous], P INT C ART INT STAT