Detecting and Analyzing Motifs in Large-Scale Online Transaction Networks

被引:0
作者
Jiang, Jiawei [1 ]
Huang, Hao [1 ]
Zheng, Zhigao [1 ]
Wei, Yi [1 ]
Fu, Fangcheng [2 ]
Li, Xiaosen [3 ]
Cui, Bin [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Peking Univ, Sch Comp Sci, Beijing 100871, Peoples R China
[3] Tencen Inc, Shenzhen 518054, Peoples R China
基金
中国国家自然科学基金;
关键词
Image edge detection; Network motifs; Social networking (online); Biology; Servers; Tagging; Proteins; Indexes; Feature extraction; Detection algorithms; Online transaction network; network motif; parameter server; DISCOVERY; MAPREDUCE;
D O I
10.1109/TKDE.2024.3511136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Motif detection is a graph algorithm that detects certain local structures in a graph. Although network motif has been studied in graph analytics, e.g., social network and biological network, it is yet unclear whether network motif is useful for analyzing online transaction network that is generated in applications such as instant messaging and e-commerce. In an online transaction network, each vertex represents a user's account and each edge represents a money transaction between two users. In this work, we try to analyze online transaction networks with network motifs. We design motif-based vertex embedding that integrates motif counts and centrality measurements. Furthermore, we design a distributed framework to detect motifs in large-scale online transaction networks. Our framework obtains the edge directions using a bi-directional tagging method and avoids redundant detection with a reduced view of neighboring vertices. We implement the proposed framework under the parameter server architecture. In the evaluation, we analyze different kinds of online transaction networks w.r.t the distribution of motifs and evaluate the effectiveness of motif-based embedding in downstream graph analytical tasks. The experimental results also show that our proposed motif detection framework can efficiently handle large-scale graphs.
引用
收藏
页码:584 / 596
页数:13
相关论文
共 59 条
  • [1] [Anonymous], 2008, P ACM SIGKDD INT C K
  • [2] [Anonymous], 2021, Twitter
  • [3] PATRIC: A Parallel Algorithm for Counting Triangles in Massive Networks
    Arifuzzaman, Shaikh
    Khan, Maleq
    Marathe, Madhav
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 529 - 538
  • [4] Bar-Yossef Z, 2002, SIAM PROC S, P623
  • [5] Simple trophic modules for complex food webs
    Bascompte, J
    Melián, CJ
    [J]. ECOLOGY, 2005, 86 (11) : 2868 - 2873
  • [6] Chen J., 2006, P ACM SIGKDD INT C K, P106, DOI DOI 10.1145/1150402.1150418
  • [7] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [8] Comparison of co-authorship networks across scientific fields using motifs
    Choobdar, Sarvenaz
    Ribeiro, Pedro
    Bugla, Sylwia
    Silva, Fernando
    [J]. 2012 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2012, : 147 - 152
  • [9] Coppersmith D., 1987, P ANN ACM S THEOR CO, P1
  • [10] Mapreduce: Simplified data processing on large clusters
    Dean, Jeffrey
    Ghemawat, Sanjay
    [J]. COMMUNICATIONS OF THE ACM, 2008, 51 (01) : 107 - 113