Efficient Temporal Butterfly Counting and Enumeration on Temporal Bipartite Graphs

被引:4
作者
Cai, Xinwei [1 ]
Ke, Xiangyu [1 ]
Wang, Kai [2 ]
Chen, Lu [1 ]
Zhang, Tianming [1 ]
Liu, Qing [1 ]
Gao, Yunjun [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
[2] Shanghai Jiao Tong Univ, ACEM, Shanghai, Peoples R China
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2023年 / 17卷 / 04期
关键词
NETWORK MOTIFS; ALGORITHMS;
D O I
10.14778/3636218.3636223
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bipartite graphs characterize relationships between two different sets of entities, like actor-movie, user-item, and author-paper. The butterfly, a 4-vertices 4-edges (2,2)-biclique, is the simplest cohesive motif in a bipartite graph and is the fundamental component of higher-order substructures. Counting and enumerating the butterflies offer significant benefits across various applications, including fraud detection, graph embedding, and community search. While the corresponding motif, the triangle, in the unipartite graphs has been widely studied in both static and temporal settings, the extension of butterfly to temporal bipartite graphs remains unexplored. In this paper, we investigate the temporal butterfly counting and enumeration problem: count and enumerate the butterflies whose edges establish following a certain order within a given duration. Towards efficient computation, we devise a non-trivial baseline rooted in the state-of-the-art butterfly counting algorithm on static graphs, further, explore the intrinsic property of the temporal butterfly, and develop a new optimization framework with a compact data structure and effective priority strategy. The time complexity is proved to be significantly reduced without compromising on space efficiency. In addition, we generalize our algorithms to practical streaming settings and multi-core computing architectures. Our extensive experiments on 11 large-scale real-world datasets demonstrate the efficiency and scalability of our solutions.
引用
收藏
页码:657 / 670
页数:14
相关论文
共 65 条
  • [1] Graphlet decomposition: framework, algorithms, and applications
    Ahmed, Nesreen K.
    Neville, Jennifer
    Rossi, Ryan A.
    Duffield, Nick G.
    Willke, Theodore L.
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 50 (03) : 689 - 722
  • [2] Aksoy SG, 2017, J COMPLEX NETW, V5, P581, DOI 10.1093/comnet/cnx001
  • [3] Boekhout H. D., 2019, Comput. Social Netw., V6, P1
  • [4] Counting Graphlets: Space vs Time
    Bressan, Marco
    Chierichetti, Flavio
    Kumar, Ravi
    Leucci, Stefano
    Panconesi, Alessandro
    [J]. WSDM'17: PROCEEDINGS OF THE TENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2017, : 557 - 566
  • [5] Cai XW, 2024, Arxiv, DOI arXiv:2306.00893
  • [6] Efficiently Answering Reachability and Path Queries on Temporal Bipartite Graphs
    Chen, Xiaoshuang
    Wang, Kai
    Lin, Xuemin
    Zhang, Wenjie
    Qin, Lu
    Zhang, Ying
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2021, 14 (10): : 1845 - 1858
  • [7] Power-Law Distributions in Empirical Data
    Clauset, Aaron
    Shalizi, Cosma Rohilla
    Newman, M. E. J.
    [J]. SIAM REVIEW, 2009, 51 (04) : 661 - 703
  • [8] Cormen Thomas H, 2022, algorithms
  • [9] Modelling disease outbreaks in realistic urban social networks
    Eubank, S
    Guclu, H
    Kumar, VSA
    Marathe, MV
    Srinivasan, A
    Toroczkai, Z
    Wang, N
    [J]. NATURE, 2004, 429 (6988) : 180 - 184
  • [10] Scalable Motif Counting for Large-scale Temporal Graphs
    Gao, Zhongqiang
    Cheng, Chuanqi
    Yu, Yanwei
    Cao, Lei
    Huang, Chao
    Dong, Junyu
    [J]. 2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 2656 - 2668