Frequent pattern mining on message passing multiprocessor systems

被引:61
|
作者
Javed, A [1 ]
Khokhar, A [1 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
基金
美国国家科学基金会;
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
10.1023/B:DAPD.0000031634.19130.bd
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Extraction of frequent patterns in transaction-oriented database is crucial to several data mining tasks such as association rule generation, time series analysis, classification, etc. Most of these mining tasks require multiple passes over the database and if the database size is large, which is usually the case, scalable high performance solutions involving multiple processors are required. This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:321 / 334
页数:14
相关论文
共 50 条
  • [1] Frequent Pattern Mining on Message Passing Multiprocessor Systems
    Asif Javed
    Ashfaq Khokhar
    Distributed and Parallel Databases, 2004, 16 : 321 - 334
  • [2] Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems
    Javed, A
    Khokhar, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2003, : 157 - 162
  • [3] Reframing in Frequent Pattern Mining
    Ahmed, Chowdhury Farhan
    Samiullah, Md.
    Lachiche, Nicolas
    Kull, Meelis
    Flach, Peter
    2015 IEEE 27TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2015), 2015, : 799 - 806
  • [4] Hierarchical Level Based Frequent Pattern Mining
    Hwang, Jeong Hee
    Kim, Hyeok
    Chi, Jeong Hee
    INTERNATIONAL JOURNAL OF GRID AND DISTRIBUTED COMPUTING, 2018, 11 (09): : 1 - 12
  • [5] COMPARATIVE STUDY OF FREQUENT PATTERN MINING TECHNIQUES
    Singh, Gauravjeet
    Bal, Sandeep
    Kaur, Poonamjeet
    Kaur, Kanwaljit
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1022 - 1027
  • [6] Efficient Adaptive Frequent Pattern Mining Techniques for Market Analysis in Sequential and Parallel Systems
    Kuriakose, Sherly
    Nedunchezhian, Raju
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (02) : 175 - 185
  • [7] Design and Analysis of a Reconfigurable Platform for Frequent Pattern Mining
    Sun, Song
    Zambreno, Joseph
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2011, 22 (09) : 1497 - 1505
  • [8] An efficient mining algorithm for frequent pattern in intrusion detection
    Li, QH
    Xiong, JJ
    Yang, HB
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 138 - 142
  • [9] Frequent Pattern Mining in Mobile Devices (A Feasibility Study)
    Rehman, Muhammad Habib Ur
    Liew, Chee Sun
    Teh, Ying Wah
    PROCEEDINGS OF THE 2014 6TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MULTIMEDIA (ICIM), 2014, : 351 - 356
  • [10] Frequent pattern mining-based sales forecasting
    Murlidharan, Vijayalakshmi
    Menezes, Bernard
    OPSEARCH, 2013, 50 (04) : 455 - 474