Parallel Graph Mining with Dynamic Load Balancing

被引:0
作者
Talukder, Nilothpal [1 ]
Zaki, Mohammed J. [2 ]
机构
[1] IBM Syst, Poughkeepsie, NY 12601 USA
[2] Rensselaer Polytech Inst, Troy, NY USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2016年
关键词
Parallel Frequent Graph Mining; Dynamic Load Balancing; High Performance Computing; FREQUENT SUBGRAPH;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Frequent subgraph mining (FSM) has important applications in areas such as bioinformatics, social networks and others. In this paper, we present a highly scalable approach called PARGRAPH that can efficiently mine from a single graph in both distributed as well as shared-memory based systems. In a distributed environment, we can leverage the local memory of multiple compute nodes for storing a large number of intermediate states for enumerating patterns. To address the skewness in the pattern generation tree, our approach uses a novel hybrid load balancing scheme to efficiently distribute workload across both processes and threads. Our experiments demonstrate good speedups using message passing interface (MPI) and OpenMP threads.
引用
收藏
页码:3352 / 3359
页数:8
相关论文
共 26 条
  • [1] Agrawal R., P 20 INT C VERY LARG
  • [2] An Iterative MapReduce Based Frequent Subgraph Mining Algorithm
    Bhuiyan, Mansurul A.
    Al Hasan, Mohammad
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (03) : 608 - 620
  • [3] Bringmann B, 2008, LECT NOTES ARTIF INT, V5012, P858, DOI 10.1007/978-3-540-68125-0_84
  • [4] Buehrer G., 2006, P 6 IEEE INT C DAT M
  • [5] Substructure Discovery Using Minimum Description Length and Background Knowledge
    Cook, Diane J.
    Holder, Lawrence B.
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1993, 1 : 231 - 255
  • [6] Dynamic load balancing for the distributed mining of molecular structures
    Di Fatta, Giuseppe
    Berthold, Michael R.
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2006, 17 (08) : 773 - 785
  • [7] DERIVATION OF A TERMINATION DETECTION ALGORITHM FOR DISTRIBUTED COMPUTATIONS
    DIJKSTRA, EW
    FEIJEN, WHJ
    VANGASTEREN, AJM
    [J]. INFORMATION PROCESSING LETTERS, 1983, 16 (05) : 217 - 219
  • [8] GRAMI: Frequent Subgraph and Pattern Mining in a Single Large Graph
    Elseidy, Mohammed
    Abdelhamid, Ehab
    Skiadopoulos, Spiros
    Kalnis, Panos
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 7 (07): : 517 - 528
  • [9] Huan J, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P549
  • [10] Huan J., 2004, PROC 10 ACM SIGKDD I, P581, DOI [DOI 10.1145/1014052.1014123, 10.1145/1014052.1014123]