An efficient approach for mining cross-level closed itemsets and minimal association rules using closed itemset lattices

被引:25
作者
Hashem, Tahrima [1 ]
Ahmed, Chowdhury Farhan [1 ]
Samiullah, Md [1 ]
Akther, Sayma [1 ]
Jeong, Byeong-Soo [2 ]
Jeon, Seokhee [1 ]
机构
[1] Univ Dhaka, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] Kyung Hee Univ, Dept Comp Engn, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
Data mining; Association rules; Frequent itemset; Closed itemset; Minimal rules; Closed itemset lattice; PATTERNS;
D O I
10.1016/j.eswa.2013.09.052
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multilevel knowledge in transactional databases plays a significant role in our real-life market basket analysis. Many researchers have mined the hierarchical association rules and thus proposed various approaches. However, some of the existing approaches produce many multilevel and cross-level association rules that fail to convey quality information. From these large number of redundant association rules, it is extremely difficult to extract any meaningful information. There also exist some approaches that mine minimal association rules, but these have many shortcomings due to their naive-based approaches. In this paper, we have focused on the need for generating hierarchical minimal rules that provide maximal information. An algorithm has been proposed to derive minimal multilevel association rules and cross-level association rules. Our work has made significant contributions in mining the minimal cross-level association rules, which express the mixed relationship between the generalized and specialized view of the transaction itemsets. We are the first to design an efficient algorithm using a closed itemset lattice-based approach, which can mine the most relevant minimal cross-level association rules. The parent-child relationship of the lattices has been exploited while mining cross-level closed itemset lattices. We have extensively evaluated our proposed algorithm's efficiency using a variety of real-life datasets and performing a large number of experiments. The proposed algorithm has outperformed the existing related work significantly during the pervasive performance comparison. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2914 / 2938
页数:25
相关论文
共 34 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
  • [3] Interactive mining of high utility patterns over data streams
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Choi, Ho-Jin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (15) : 11979 - 11991
  • [4] Single-pass incremental and interactive mining for weighted frequent patterns
    Ahmed, Chowdhury Farhan
    Tanbeer, Syed Khairuzzaman
    Jeong, Byeong-Soo
    Lee, Young-Koo
    Choi, Ho-Jin
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (09) : 7976 - 7994
  • [5] Alramouni S, 2005, DMIN '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON DATA MINING, P97
  • [6] [Anonymous], 20 INT C VER LARG DA
  • [7] [Anonymous], 2000, SIGMOD INT WORKSHOP
  • [8] Bastide I, 2000, LECT NOTES ARTIF INT, V1861, P972
  • [9] Bay Vo, 2011, International Journal of Intelligent Systems Technologies and Applications, V10, P92, DOI 10.1504/IJISTA.2011.038265
  • [10] Brin S., 1997, P 1997 ACM SIGMOD IN, P265