A Unified and Scalable Algorithm Framework of User-Defined Temporal (k, χ)-Core Query

被引:0
作者
Zhong, Ming [1 ]
Yang, Junyong [1 ]
Zhu, Yuanyuan [1 ]
Qian, Tieyun [1 ]
Liu, Mengchi [2 ]
Yu, Jeffrey Xu [3 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Hubei, Peoples R China
[2] South China Normal Univ, Guangzhou 510631, Guangdong, Peoples R China
[3] Chinese Univ Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
k-core; online algorithm; query processing; scalability; temporal graph; time interval; user-defined function; INFLUENTIAL COMMUNITY SEARCH; MOLECULAR NETWORKS; CORE DECOMPOSITION; PROTEIN NETWORKS;
D O I
10.1109/TKDE.2023.3349310
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Querying cohesive subgraphs on temporal graphs (e.g., social network, finance network, etc.) with various conditions has attracted intensive research interests recently. In this paper, we study a novel Temporal (k,chi)-Core Query (TXCQ) that extends a fundamental Temporal k-Core Query (TCQ) proposed in our conference paper by optimizing or constraining an arbitrary metric chi of k-core, such as size, engagement, interaction frequency, time span, burstiness, periodicity, etc. Our objective is to address specific TXCQ instances with conditions on different X in a unified algorithm framework that guarantees scalability. For that, this journal paper proposes a taxonomy of measurement chi(<middle dot>) and achieve our objective using a two-phase framework while chi(<middle dot>) is time-insensitive or time-monotonic. Specifically, Phase 1 still leverages the query processing algorithm of TCQ to induce all distinct k-cores during a given time range, and meanwhile locates the "time zones" in which the cores emerge. Then, Phase 2 conducts fast local search and chi evaluation in each time zone with respect to the time insensitivity or monotonicity of chi(<middle dot>). By revealing two insightful concepts named tightest time interval and loosest time interval that bound time zones, the redundant core induction and unnecessary chi evaluation in a zone can be reduced dramatically. Our experimental results demonstrate that TXCQ can be addressed as efficiently as TCQ, which achieves the latest state-of-the-art performance, by using a general algorithm framework that leaves chi(<middle dot>) as a user-defined function.
引用
收藏
页码:2831 / 2845
页数:15
相关论文
共 61 条
[1]  
[Anonymous], 2020, VLDB J., V29, P353
[2]  
[Anonymous], 2014, SNAP Datasets: Stanford large network dataset collection
[3]   Efficient temporal core maintenance of massive graphs [J].
Bai, Wen ;
Chen, Yadi ;
Wu, Di .
INFORMATION SCIENCES, 2020, 513 :324-340
[4]   Distance-generalized Core Decomposition [J].
Bonchi, Francesco ;
Khan, Arijit ;
Severini, Lorenzo .
SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, :1006-1023
[5]   Molecular networks: The top-down view [J].
Bray, D .
SCIENCE, 2003, 301 (5641) :1864-1865
[6]  
Chen PL, 2014, IEEE INT CONF BIG DA, P471, DOI 10.1109/BigData.2014.7004264
[7]  
Chen YK, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P3544
[8]   Online Density Bursting Subgraph Detection from Temporal Graphs [J].
Chu, Lingyang ;
Zhang, Yanyan ;
Yang, Yu ;
Wang, Lanjun ;
Pei, Jian .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2019, 12 (13) :2353-2365
[9]  
Dong Z, 2021, Arxiv, DOI arXiv:2105.08628
[10]   A survey of community search over big graphs [J].
Fang, Yixiang ;
Huang, Xin ;
Qin, Lu ;
Zhang, Ying ;
Zhang, Wenjie ;
Cheng, Reynold ;
Lin, Xuemin .
VLDB JOURNAL, 2020, 29 (01) :353-392