A Fast Parallel Algorithm for Discovering Frequent Patterns

被引:7
|
作者
Lin, Kawuu W. [1 ]
Luo, Yu-Chin [1 ]
机构
[1] Natl Kaohsiung Univ Appl Sci, Dept Comp Sci & Informat Engn, Kaohsiung 807, Taiwan
来源
2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009) | 2009年
关键词
Data mining; cloud computing; association rule mining; frequent pattern mining; privacy preserved;
D O I
10.1109/GRC.2009.5255089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast discovery of frequent patterns is the most extensively discussed problem in data mining fields due to its wide applications. As the size of database increases, the computation time and the required memory increase severely. The difficulty of mining large database launched the research of designing parallel and distributed algorithms to solve the problem. Most of the past studies tried to parallelize the computation by dividing the database and distribute the divided database to other nodes for mining. This approach might leak data out and evidently is not suitable to be applied to sensitive domains like health-care. In this paper, we propose a novel data mining algorithm named FD-Mine that is able to efficiently utilize the nodes to discover frequent patterns in cloud computing environments with data privacy preserved. Through empirical evaluations on various simulation conditions, the proposed FD-Mine delivers excellent performance in terms of scalability and execution time.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [1] A fast and resource efficient mining algorithm for discovering frequent patterns in distributed computing environments
    Lin, Kawuu W.
    Chung, Sheng-Hao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2015, 52 : 49 - 58
  • [2] Fast Discovering Frequent Patterns for Incremental XML Queries
    PENG Dun-lu 1
    2.Department of Computer Science and Engineering
    Wuhan University Journal of Natural Sciences, 2004, (05) : 638 - 646
  • [3] Parallel Frequent Patterns Mining Algorithm on GPU
    Zhou, Jiayi
    Yu, Kun-Ming
    Wu, Bin-Chang
    2010 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2010), 2010,
  • [4] A fast and distributed algorithm for mining frequent patterns in congested networks
    Kawuu W. Lin
    Sheng-Hao Chung
    Chun-Cheng Lin
    Computing, 2016, 98 : 235 - 256
  • [5] A fast and distributed algorithm for mining frequent patterns in congested networks
    Lin, Kawuu W.
    Chung, Sheng-Hao
    Lin, Chun-Cheng
    COMPUTING, 2016, 98 (03) : 235 - 256
  • [6] A novel parallel algorithm for frequent pattern mining with privacy preserved in cloud computing environments
    Lin, Kawuu W.
    Deng, Der-Jiunn
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2010, 6 (04) : 205 - 215
  • [7] Scalable parallel algorithm for mining frequent patterns on message passing multiprocessor systems
    Javed, A
    Khokhar, A
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS, 2003, : 157 - 162
  • [8] Discovering exclusive patterns in frequent sequences
    Chen, Weiru
    Lu, Jing
    Keech, Malcolm
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2010, 2 (03) : 252 - 267
  • [9] Fast algorithm to discovering sequential patterns from large databases
    Hu Huirong
    PROCEEDINGS OF THE 24TH CHINESE CONTROL CONFERENCE, VOLS 1 AND 2, 2005, : 1352 - 1355
  • [10] And code algorithm for discovering frequent itemsets
    Zhou, HY
    Zhang, Y
    THIRD INTERNATIONAL CONFERENCE ON ELECTRONIC COMMERCE ENGINEERING: DIGITAL ENTERPRISES AND NONTRADITIONAL INDUSTRIALIZATION, 2003, : 569 - 572