Frequent pattern mining on message passing multiprocessor systems

被引:61
|
作者
Javed, A [1 ]
Khokhar, A [1 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
10.1023/B:DAPD.0000031634.19130.bd
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of frequent patterns in transaction-oriented database is crucial to several data mining tasks such as association rule generation, time series analysis, classification, etc. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the case, scalable high performance solutions involving multiple processors are required. This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 50 条
  • [31] Survey of differential privacy in frequent pattern mining
    Ding, Li-Ping
    Lu, Guo-Qing
    Tongxin Xuebao/Journal on Communications, 2014, 35 (10): : 200 - 209
  • [32] Frequent Pattern Mining for Kernel Trace Data
    LaRosa, Christopher
    Xiong, Li
    Mandelberg, Ken
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 880 - 885
  • [33] Frequent Pattern Mining for Online Handwriting Recognition
    Gmati, Chekib
    Sliti, Oumaima
    Hamam, Habib
    Lachiri, Zied
    2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [34] Extending Task Parallelism For Frequent Pattern Mining
    Kambadur, Prabhanjan
    Ghoting, Amol
    Gupta, Anshul
    Lumsdaine, Andrew
    PARALLEL COMPUTING: FROM MULTICORES AND GPU'S TO PETASCALE, 2010, 19 : 289 - 296
  • [35] Frequent Pattern Mining in Big Social Graphs
    Li, Lei
    Ding, Ping
    Chen, Huanhuan
    Wu, Xindong
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (03): : 638 - 648
  • [36] Improved Pattern Tree for Incremental Frequent-Pattern Mining
    周明
    王太勇
    Transactions of Tianjin University , 2010, (02) : 129 - 134
  • [37] Frequent pattern mining as a clique extracting task
    Kuusik, R
    Lind, G
    Vohandu, L
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: INFORMATION SYSTEMS, TECHNOLOGIES AND APPLICATIONS: I, 2004, : 425 - 428
  • [38] Mining Condensed Frequent-Pattern Bases
    Jian Pei
    Guozhu Dong
    Wei Zou
    Jiawei Han
    Knowledge and Information Systems, 2004, 6 : 570 - 594
  • [39] Tightening upper bounds to the expected support for uncertain frequent pattern mining
    Leung, Carson K.
    MacKinnon, Richard Kyle
    Tanbeer, Syed K.
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 328 - 337
  • [40] Community detection in social networks using user frequent pattern mining
    Moosavi, Seyed Ahmad
    Jalali, Mehrdad
    Misaghian, Negin
    Shamshirband, Shahaboddin
    Anisi, Mohammad Hossein
    KNOWLEDGE AND INFORMATION SYSTEMS, 2017, 51 (01) : 159 - 186