Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems

被引:0
|
作者
Javed, A [1 ]
Khokhar, A [1 ]
机构
[1] Univ Illinois, Dept CS, Chicago, IL 60612 USA
来源
PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS | 2003年
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [31] Data token heuristic scheduling of the Kalman algorithm onto a message-passing multiprocessor system
    Vaidehi, V
    Krishnan, CN
    CONTROL ENGINEERING PRACTICE, 1997, 5 (12) : 1691 - 1703
  • [32] An Efficient Parallel Method for Mining Frequent Closed Sequential Patterns
    Bao Huynh
    Bay Vo
    Snasel, Vaclav
    IEEE ACCESS, 2017, 5 : 17392 - 17402
  • [33] OPTIMIZATION AND REALIZATION OF PARALLEL FREQUENT ITEM SET MINING ALGORITHM
    Yuan, Ling
    Li, Dan
    Chen, Yuzhong
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 546 - 551
  • [34] Parallel frequent itemsets mining algorithm without intermediate result
    Lan, YJ
    Qiu, Y
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 2102 - 2107
  • [35] Load balancing approach parallel algorithm for frequent pattern mining
    Yu, Kun-Ming
    Zhou, Jiayi
    Hsia, Wei Chen
    PARALLEL COMPUTING TECHNOLOGIES, PROCEEDINGS, 2007, 4671 : 623 - +
  • [36] A novel parallel frequent itemset mining algorithm for automatic enterprise
    Mao, Yimin
    Wu, Bin
    Deng, Qianhu
    Mahmoodi, Soroosh
    Chen, Zhigang
    Chen, Yeh-Cheng
    ENTERPRISE INFORMATION SYSTEMS, 2023, 17 (10)
  • [37] Improved algorithm for mining maximum frequent patterns based on FP-Tree
    Liu, Naili
    Ma, Lei
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION APPLICATIONS (ICCIA 2012), 2012, : 833 - 836
  • [38] A NOVEL ALGORITHM FOR FAST MINING FREQUENT PATTERNS BASED ON SUPPORT LIST STRUCTURE
    Zhu, Xiaolin
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2022, 23 (09) : 1943 - 1966
  • [39] A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments
    Lin, Kawuu W.
    Deng, Der-Jiunn
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2010, 6 (04) : 205 - 215
  • [40] An efficient algorithm for mining frequent inter-transaction patterns
    Lee, Anthony J. T.
    Wang, Chun-Sheng
    INFORMATION SCIENCES, 2007, 177 (17) : 3453 - 3476