The Input/Output Complexity of Triangle Enumeration

被引:30
作者
Pagh, Rasmus [1 ]
Silvestri, Francesco [1 ,2 ]
机构
[1] IT Univ Copenhagen, Copenhagen, Denmark
[2] Univ Padua, Dept Informat Engn, Padua, Italy
来源
PODS'14: PROCEEDINGS OF THE 33RD ACM SIGMOD-SIGACT-SIGART SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS | 2014年
基金
新加坡国家研究基金会;
关键词
Triangle listing; external memory; cache-oblivious; cache-aware; lower bound;
D O I
10.1145/2594538.2594552
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the well-known problem of enumerating all triangles of an undirected graph. Our focus is on determining the input/output (I/O) complexity of this problem. Let E be the number of edges, M < E the size of internal memory, and B the block size. The best results obtained previously are sort(E-3/2) I/Os (Dementiev, PhD thesis 2006) and O (E-2/(MB)) I/Os (Hu et al., SIGMOD 2013), where sort(n) denotes the number of I/Os for sorting n items. We improve the I/O complexity to O (E-3/2/(root MB)) expected I/Os, which improves the previous bounds by a factor min(root E/M, root M). Our algorithm is cache-oblivious and also I/O optimal: We show that any algorithm enumer- ating t distinct triangles must always use Omega (t/root MB) I/Os, and there are graphs for which t = Omega (E-3/2). Finally, we give a deterministic cache-aware algorithm using O (E-3/2/( root MB)) I/Os assuming M >= E-epsilon for a constant( )epsilon > 0. Our results are based on a new color coding technique, which may be of independent interest.
引用
收藏
页码:224 / 233
页数:10
相关论文
共 25 条
  • [1] Afrati FN, 2013, PROC VLDB ENDOW, V6, P277
  • [2] THE INPUT OUTPUT COMPLEXITY OF SORTING AND RELATED PROBLEMS
    AGGARWAL, A
    VITTER, JS
    [J]. COMMUNICATIONS OF THE ACM, 1988, 31 (09) : 1116 - 1127
  • [3] SIMPLE CONSTRUCTIONS OF ALMOST K-WISE INDEPENDENT RANDOM-VARIABLES
    ALON, N
    GOLDREICH, O
    HASTAD, J
    PERALTA, R
    [J]. RANDOM STRUCTURES & ALGORITHMS, 1992, 3 (03) : 289 - 304
  • [4] [Anonymous], 2011, P INT C WORLD WID WE, DOI DOI 10.1145/1963405.1963491
  • [5] Arge L., 1999, External Memory Algorithms. DIMACS Workshop External Memory Algorithms and Visualization, P139
  • [6] Berry Jonathan, 2014, P 5 INN THEOR COMP S
  • [7] Tolerating the community detection resolution limit with edge weighting
    Berry, Jonathan W.
    Hendrickson, Bruce
    LaViolette, Randall A.
    Phillips, Cynthia A.
    [J]. PHYSICAL REVIEW E, 2011, 83 (05)
  • [8] Brodal GerthStolting., 2003, STOC 03, P307, DOI 10.1145/780542.780589
  • [9] Triangle Listing in Massive Networks
    Chu, Shumo
    Cheng, James
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2012, 6 (04)
  • [10] Dementiev Roman, 2007, THESIS