Scalable probabilistic truss decomposition using central limit theorem and H-index

被引:2
作者
Esfahani, Fatemeh [1 ]
Daneshmand, Mahsa [1 ]
Srinivasan, Venkatesh [1 ]
Thomo, Alex [1 ]
Wu, Kui [1 ]
机构
[1] Univ Victoria, Victoria, BC, Canada
关键词
Truss decomposition; Dense subgraphs; Probabilistic graphs;
D O I
10.1007/s10619-022-07415-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Truss decomposition is a popular notion of hierarchical dense substructures in graphs. In a nutshell, k-truss is the largest subgraph in which every edge is contained in at least k triangles. Truss decomposition aims to compute k-trusses for each possible value of k. There are many works that study truss decomposition in deterministic graphs. However, in probabilistic graphs, truss decomposition is significantly more challenging and has received much less attention; state-of-the-art approaches do not scale well to large probabilistic graphs. Finding the tail probabilities of the number of triangles that contain each edge is a critical challenge of those approaches. This is achieved using dynamic programming which has quadratic run-time and thus not scalable to real large networks which, quite commonly, can have edges contained in many triangles (in the millions). To address this challenge, we employ a special version of the Central Limit Theorem (CLT) to obtain the tail probabilities efficiently. Based on our CLT approach we propose a peeling algorithm for truss decomposition that scales to large probabilistic graphs and offers significant improvement over state-of-the-art. We also design a second method which progressively tightens the estimate of the truss value of each edge and is based on h-index computation. In contrast to our CLT-based approach, our h-index algorithm (1) is progressive by allowing the user to see near-results along the way, (2) does not sacrifice the exactness of final result, and (3) achieves all these while processing only one edge and its immediate neighbors at a time, thus resulting in smaller memory footprint. We perform extensive experiments to show the scalability of both of our proposed algorithms.
引用
收藏
页码:299 / 333
页数:35
相关论文
共 33 条
  • [21] Local Algorithms for Hierarchical Dense Subgraph Discovery
    Sariyuce, Ahmet Erdem
    Seshadhri, C.
    Pinar, Ali
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2018, 12 (01): : 43 - 56
  • [22] Truss Decomposition in Massive Networks
    Wang, Jia
    Cheng, James
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (09): : 812 - 823
  • [23] Wu J, 2018, 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), P873, DOI 10.1109/ASONAM.2018.8508642
  • [24] Yuan Y, 2011, PROC VLDB ENDOW, V4, P876
  • [25] Efficient Keyword Search on Uncertain Graph Data
    Yuan, Ye
    Wang, Guoren
    Chen, Lei
    Wang, Haixun
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2013, 25 (12) : 2767 - 2779
  • [26] Yuxuan Xing, 2017, Security, Privacy and Anonymity in Computation, Communication and Storage, SpaCCS 2017: International Workshops. Proceedings: LNCS 10658, P719, DOI 10.1007/978-3-319-72395-2_64
  • [27] Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks
    Zhang, Yang
    Parthasarathy, Srinivasan
    [J]. 2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1049 - 1060
  • [28] Large Scale Cohesive Subgraphs Discovery for Social Network Visual Analysis
    Zhao, Feng
    Tung, Anthony K. H.
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 6 (02): : 85 - 96
  • [29] Zhou Yan-jiao, 2019, Nature Communications, V10, DOI [10.1038/s41467-019-12821-2, 10.1038/s41467-019-13698-x]
  • [30] Truss decomposition of uncertain graphs
    Zou, Zhaonian
    Zhu, Rong
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (01) : 197 - 230