Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems

被引:0
作者
Javed, A [1 ]
Khokhar, A [1 ]
机构
[1] Univ Illinois, Dept CS, Chicago, IL 60612 USA
来源
PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS | 2003年
关键词
frequent pattern mining; parallel processing; association rule; data mining;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an efficient scalable parallel algorithm for mining frequent patterns on parallel shared nothing platforms. The proposed algorithm is based on one of the best known sequential techniques referred to as Frequent Pattern (FP) Growth algorithm. Unlike most of the earlier parallel approaches based on different variants of the Apriori Algorithm, the algorithm presented in this paper does not explicitly result in having entire counting data structure duplicated on each processor. Furthermore, the proposed algorithm introduces minimum communication (and hence synchronization) overheads by efficiently partitioning the list of frequent elements list over processors. The experimental results show scalable performance over different machine and problem sizes. The comparison of implementation results with existing parallel approaches show significant gains in the speedup. On an 8-processor machine, we report an average speedup of 6 for different problem sizes.
引用
收藏
页码:157 / 162
页数:6
相关论文
共 50 条
  • [41] An Improved Algorithm for Mining Frequent Inter-Transaction Patterns
    Thanh-Ngo Nguyen
    Nguyen, Loan T. T.
    Ngoc-Thanh Nguyen
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA), 2017, : 296 - 301
  • [42] An Algorithm of Frequent Patterns Mining Based on Binary Information Granule
    Fang, G.
    Wu, Y.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER INFORMATION SYSTEMS AND INDUSTRIAL APPLICATIONS (CISIA 2015), 2015, 18 : 47 - 50
  • [43] An Efficient and Fast Algorithm for Mining Frequent Patterns on Multiple Biosequences
    Liu, Wei
    Chen, Ling
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE IV, PT 1, 2011, 344 : 178 - 194
  • [44] Review of Algorithm for Mining Frequent Patterns from Uncertain Data
    Yue, Liwen
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 17 - 21
  • [45] Parallel Mining of Frequent Patterns for School Records Analytics at the Universidad Michoacana
    Flores, Juan J.
    Luis Garcia-Nava, J.
    Castro-Coria, Monserrat A.
    Tellez, Victor M.
    Huerta B, Emanuel
    Espinosa-Romero, Josue
    Calderon, Felix
    2017 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2017,
  • [46] A Distributed Algorithm for Fast Mining Frequent Patterns in Limited and Varying Network Bandwidth Environments
    Lin, Chun-Cheng
    Li, Wei-Ching
    Chen, Ju-Chin
    Chung, Wen-Yu
    Chung, Sheng-Hao
    Lin, Kawuu W.
    APPLIED SCIENCES-BASEL, 2019, 9 (09):
  • [47] SPMBR: A Scalable Algorithm for Mining Sequential Patterns Based on Bitmaps
    Xu, Xiwei
    Zhang, Changhai
    SIXTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2013), 2013, 9067
  • [48] A fast and resource efficient mining algorithm for discovering frequent patterns in distributed computing environments
    Lin, Kawuu W.
    Chung, Sheng-Hao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 52 : 49 - 58
  • [49] Efficient Adaptive Frequent Pattern Mining Techniques for Market Analysis in Sequential and Parallel Systems
    Kuriakose, Sherly
    Nedunchezhian, Raju
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (02) : 175 - 185
  • [50] An efficient parallel algorithm for mining weighted clickstream patterns
    Huynh, Huy M.
    Nguyen, Loan T. T.
    Vo, Bay
    Oplatkova, Zuzana Kominkova
    Fournier-Viger, Philippe
    Yun, Unil
    INFORMATION SCIENCES, 2022, 582 : 349 - 368