Mining Frequent Patterns in Evolving Graphs

被引:19
作者
Aslay, Cigdem [1 ]
Nasir, Muhammad Anis Uddin [2 ]
Morales, Gianmarco De Francisci [3 ]
Gionis, Aristides [1 ]
机构
[1] Aalto Univ, Helsinki, Finland
[2] King Digital Entertainment Ltd, Dublin, Ireland
[3] ISI Fdn, Turin, Italy
来源
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2018年
基金
芬兰科学院; 欧盟地平线“2020”;
关键词
SUBGRAPH;
D O I
10.1145/3269206.3271772
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Given a labeled graph, the frequent-subgraph mining (FSM) problem asks to find all the k-vertex subgraphs that appear with frequency greater than a given threshold. FSM has numerous applications ranging from biology to network science, as it provides a compact summary of the characteristics of the graph. However, the task is challenging, even more so for evolving graphs due to the streaming nature of the input and the exponential time complexity of the problem. In this paper, we initiate the study of the approximate FSM problem in both incremental and fully-dynamic streaming settings, where arbitrary edges can be added or removed from the graph. For each streaming setting, we propose algorithms that can extract a high-quality approximation of the frequent k-vertex subgraphs for a given threshold, at any given time instance, with high probability. In contrast to the existing state-of-the-art solutions that require iterating over the entire set of subgraphs for any update, our algorithms operate by maintaining a uniform sample of k-vertex subgraphs with optimized neighborhood-exploration procedures local to the updates. We provide theoretical analysis of the proposed algorithms and empirically demonstrate that the proposed algorithms generate high-quality results compared to baselines.
引用
收藏
页码:923 / 932
页数:10
相关论文
共 41 条
[1]   Incremental Frequent Subgraph Mining on Large Evolving Graphs [J].
Abdelhamid, Ehab ;
Canim, Mustafa ;
Sadoghi, Mohammad ;
Bhattacharjee, Bishwaranjan ;
Chang, Yuan-Chi ;
Kalnis, Panos .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (12) :2710-2723
[2]  
[Anonymous], 2001, TECHNICAL REPORT
[3]   GUISE: Uniform Sampling of Graphlets for Large Graph Analysis [J].
Bhuiyan, Mansurul A. ;
Rahman, Mahmudur ;
Rahman, Mahmuda ;
Al Hasan, Mohammad .
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, :91-100
[4]  
Bifet Albert., 2011, P 17 ACM SIGKDD INT, P591, DOI DOI 10.1145/2020408.2020501
[5]  
Borgwardt KM, 2006, IEEE DATA MINING, P818
[6]   Counting Graphlets: Space vs Time [J].
Bressan, Marco ;
Chierichetti, Flavio ;
Kumar, Ravi ;
Leucci, Stefano ;
Panconesi, Alessandro .
WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, :557-566
[7]  
Chaoji Vineet, 2008, SADM, V1, P67, DOI DOI 10.1002/SAM.10004
[8]  
Chen Chen, 2007, ICDM
[9]  
Chen X., 2017, ASONAM, P131
[10]  
Chen XW, 2016, PROC VLDB ENDOW, V10, P253