Mining Diversified Top-r Lasting Cohesive Subgraphs on Temporal Networks

被引:10
作者
Lin, Longlong [1 ]
Yuan, Pingpeng [1 ]
Li, Ronghua [2 ]
Jin, Hai [1 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol & Syst, Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab,Sch Comp Sci & Technol, Wuhan 430074, Hubei, Peoples R China
[2] Beijing Inst Technol, Dept Comp Sci, Beijing 100081, Peoples R China
关键词
Cohesive subgraphs; temporal networks; lasting pattern mining; diversified top-r search; SEARCH; GRAPHS;
D O I
10.1109/TBDATA.2021.3058294
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Temporal subgraph mining is recently ubiquitous. Identifying diversified and lasting ingredients is a fundamental problem in analyzing temporal networks. In this paper, we investigate the problem of finding diversified lasting cohesive subgraphs from temporal networks. Specifically, we first introduce a new model, called maximal lasting (k, sigma)-core, for characterizing lasting cohesive subgraphs on temporal networks so as to the nodes in the subgraph are connected densely and also the subgraph's structure remains unchanged for a period of time. To enhance the diversity of results, we then formulate a diversified lasting cohesive subgraphs problem, which finds r maximal lasting (k, sigma)-cores with maximum coverage regarding the number of vertices and timestamps. Unfortunately, we show that the optimization problem is NP-hard. To tackle this issue, we first devise a greedy algorithm named GreLC with (1-1/e) approximation ratio. However, GreLC has prohibitively high time and space complexity, resulting in poor scalability. Then, an improved DFS-based search algorithm called TopLC with 1/4 approximation ratio is proposed to lower the computational cost. Finally, empirical studies on six real-world temporal networks demonstrate that the proposed solutions perform efficiently and accurately, and our model is better than temporal cohesive subgraphs detected by existing approaches.
引用
收藏
页码:1537 / 1549
页数:13
相关论文
共 47 条
[1]  
Agrawal R., 2009, P 2 ACM INT C WEB SE, P5, DOI 10.1145/1498759.1498766
[2]  
Ahmed R., 2011, Proceedings of the 2011 IEEE 11th International Conference on Data Mining (ICDM 2011), P1, DOI 10.1109/ICDM.2011.20
[3]  
Angel A., 2011, SIGMOD, P781
[4]   Online maximum k-coverage [J].
Ausiello, G. ;
Boria, N. ;
Giannakos, A. ;
Lucarelli, G. ;
Paschos, V. Th. .
DISCRETE APPLIED MATHEMATICS, 2012, 160 (13-14) :1901-1913
[5]  
Batagelj V., 2003, ARXIV
[6]   PREVENTING UNRAVELING IN SOCIAL NETWORKS: THE ANCHORED K-CORE PROBLEM [J].
Bhawalkar, Kshipra ;
Kleinberg, Jon ;
Lewi, Kevin ;
Roughgarden, Tim ;
Sharma, Aneesh .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2015, 29 (03) :1452-1475
[7]   MiMAG: mining coherent subgraphs in multi-layer graphs with edge labels [J].
Boden, Brigitte ;
Guennemann, Stephan ;
Hoffmann, Holger ;
Seidl, Thomas .
KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (02) :417-446
[8]   Distance-generalized Core Decomposition [J].
Bonchi, Francesco ;
Khan, Arijit ;
Severini, Lorenzo .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1006-1023
[9]   Efficient Maximum Clique Computation over Large Sparse Graphs [J].
Chang, Lijun .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :529-538
[10]   Cohesive Subgraph Computation over Large Sparse Graphs [J].
Chang, Lijun ;
Qin, Lu .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, :2068-2071