Triangle and Four Cycle Counting in the Data Stream Model

被引:5
作者
McGregor, Andrew [1 ]
Vorotnikova, Sofya [2 ]
机构
[1] Univ Massachusetts, Amherst, MA 01003 USA
[2] Dartmouth Coll, Hanover, NH 03755 USA
来源
PODS'20: PROCEEDINGS OF THE 39TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS | 2020年
关键词
Data streams; triangles; cycles; GRAPH; ALGORITHMS; COMPLEXITY; 2ND;
D O I
10.1145/3375395.3387652
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of estimating the number of cycles in a graph is one of the most widely studied graph problems in the data stream model. Three relevant variants of the data stream model include: the arbitrary order model in which the stream consists of the edges of the graph in arbitrary order, the random order model in which the edges are randomly permuted, and the adjacency list order model in which all edges incident to the same vertex appear consecutively. In this paper, we focus on the problem of triangle and four-cycle counting in these models. We improve over the state-of-the-art results as follows, where n is the number of vertices, m is the number of edges and T is the number of triangles/four-cycles in the graph (i.e., the quantity being estimated): Random Order Model: We present a single-pass algorithm that (1 + epsilon)-approximates the number of triangles using (O) over tilde (c(-2)m/root T) space and prove that this is optimal in the range T <= root m. The best previous result, a (3 + epsilon)-approximation using (O) over tilde(epsilon(-4.5)m/root T) space, was presented by Cormode and Jowhari (Theor. Comput. Sci. 2017). Adjacency List Model: We present an algorithm that returns a (1 + epsilon)-approximation of the number of 4-cycles using two passes and (O) over tilde(epsilon(-4)m/root T) space. The best previous result, a constant approximation using e O(m/T 3/8) space, was presented by Kallaugher et al. (PODS 2019). We also show that (1 +epsilon)-approximation in a single pass is possible in a) polylog( n) space if T = Omega(n(2)) and b) (O) over tilde (n) space if T = Omega(n). Arbitrary Order Model: We present a three-pass algorithm that (1 + epsilon)-approximates the number of 4-cycles using e O(.-2m/ T 1/4) space and a one-pass algorithm that uses (O) over tilde(epsilon(-2)n) space when T = (O) over tilde (n(2)). The best existing result, a (1 + epsilon)-approximation using (O) over tilde(epsilon(-2)m(2) /T) space, was presented by Bera and Chakrabarti (STACS 2017). We also show a multi-pass lower bound and another algorithm for distinguishing graphs with no four cycles and graphs with many 4-cycles.
引用
收藏
页码:445 / 456
页数:12
相关论文
共 33 条
[1]   On Dense Pattern Mining in Graph Streams [J].
Aggarwal, Charu C. ;
Li, Yao ;
Yu, Philip S. ;
Jin, Ruoming .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :975-984
[2]   The space complexity of approximating the frequency moments [J].
Alon, N ;
Matias, Y ;
Szegedy, M .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 58 (01) :137-147
[3]  
[Anonymous], 2010, SODA
[4]  
[Anonymous], 2012, P 31 ACM SIGMOD SIGA, DOI [DOI 10.1145/2213556.2213560, 10.1145/2213556.2213560.13]
[5]  
[Anonymous], 2011, P 30 ACM SIGMOD SIGA, DOI DOI 10.1145/1989284.1989289
[6]   Mining Frequent Patterns in Evolving Graphs [J].
Aslay, Cigdem ;
Nasir, Muhammad Anis Uddin ;
Morales, Gianmarco De Francisci ;
Gionis, Aristides .
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, :923-932
[7]  
Bar-Yossef Z, 2002, SIAM PROC S, P623
[8]   Towards Tighter Space Bounds for Counting Triangles and Other Substructures in Graph Streams [J].
Bera, Suman K. ;
Chakrabarti, Amit .
34TH SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2017), 2017, 66
[9]  
Braverman V, 2013, LECT NOTES COMPUT SC, V7965, P244, DOI 10.1007/978-3-642-39206-1_21
[10]   Triangle Counting in Dynamic Graph Streams [J].
Bulteau, Laurent ;
Froese, Vincent ;
Kutzkov, Konstantin ;
Pagh, Rasmus .
ALGORITHMICA, 2016, 76 (01) :259-278