A novel algorithm for frequent itemset mining in data warehouses

被引:3
|
作者
徐利军
谢康林
机构
[1] Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai 200030 China
[2] Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai 200030 China
关键词
Frequent itemset; Close itemset; Star schema; Dimension table; Fact table;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
1201 ;
摘要
Current technology for frequent itemset mining mostly applies to the data stored in a single transaction database. This paper presents a novel algorithm MultiClose for frequent itemset mining in data warehouses. MultiClose respectively computes the results in single dimension tables and merges the results with a very efficient approach. Close itemsets technique is used to improve the performance of the algorithm. The authors propose an efficient implementation for star schemas in which their al- gorithm outperforms state-of-the-art single-table algorithms.
引用
收藏
页码:216 / 224
页数:9
相关论文
共 50 条
  • [21] A Generalized Parallel Algorithm for Frequent Itemset Mining
    Craus, Mitica
    Archip, Alexandru
    PROCEEDINGS OF THE 12TH WSEAS INTERNATIONAL CONFERENCE ON COMPUTERS , PTS 1-3: NEW ASPECTS OF COMPUTERS, 2008, : 520 - +
  • [22] An Incremental Algorithm for Frequent Itemset Mining on Spark
    Yu, Min
    Zuo, Chuang
    Yuan, Yunpeng
    Yang, Yulu
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 281 - 285
  • [23] Parallel Frequent Itemset Mining on Streaming Data
    He, Yanshan
    Yue, Min
    2014 10TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION (ICNC), 2014, : 725 - 730
  • [24] Frequent Itemset Mining for Big Data in social media using ClustBigFIM algorithm
    Gole, Sheela
    Tidke, Bharat
    2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [25] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
    Feng Zhang
    Min Liu
    Feng Gui
    Weiming Shen
    Abdallah Shami
    Yunlong Ma
    Cluster Computing, 2015, 18 : 1493 - 1501
  • [26] A distributed frequent itemset mining algorithm using Spark for Big Data analytics
    Zhang, Feng
    Liu, Min
    Gui, Feng
    Shen, Weiming
    Shami, Abdallah
    Ma, Yunlong
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2015, 18 (04): : 1493 - 1501
  • [27] A Frequent Itemset Reduction Algorithm for Global Pattern Mining on Distributed Data Streams
    Shalini
    Jain, Sanjay Kumar
    2017 TENTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2017, : 205 - 210
  • [28] Efficient Skyline Frequent-Utility Itemset Mining Algorithm on Massive Data
    He, Jingxuan
    Han, Xixian
    Wan, Xiaolong
    Wang, Jinbao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (07) : 3009 - 3023
  • [29] Fast Mining Algorithm of Frequent Itemset Based on Spark
    Ding J.-M.
    Li H.-B.
    Deng B.
    Jia L.-Y.
    You J.-G.
    Ruan Jian Xue Bao/Journal of Software, 2023, 34 (05): : 2446 - 2464
  • [30] Frequent Itemset Mining Algorithm based on Sampling Method
    Li, Haifeng
    Zhang, Ning
    Zhang, Yuejin
    PROCEEDINGS OF THE 2015 5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCES AND AUTOMATION ENGINEERING, 2016, 42 : 852 - 855