A time-efficient breadth-first level-wise lattice-traversal algorithm to discover rare itemsets

被引:20
作者
Troiano, Luigi [1 ]
Scibelli, Giacomo [1 ]
机构
[1] Univ Sannio, Dept Engn, I-81031 Benevento, Italy
关键词
Data mining methods and algorithms; Rare itemsets; RULES;
D O I
10.1007/s10618-013-0304-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we face the problem of searching for rare itemsets. A main issue regards the strategy to adopt in exploring the power set lattice. Assuming a power set lattice with full set at the top and empty set at the bottom, the most of the algorithms adopt a bottom-up exploration, i.e. moving from smaller to larger sets. Although this approach is advantageous in the case of frequent itemsets, it might not be worth being used for rare itemsets, as they occur on the top of the lattice. We propose Rarity, a top-down breadth-first level-wise algorithm. Experimental results and comparisons are illustrated in order to provide a quantitative characterization of algorithm performances and complexity. Application to some UCI benchmark and real world datasets is provided. An algorithm parallelization is outlined. Experiments showed that this approach takes advantage of finding all rare non-zero itemsets in less time than other solutions, at expenses of higher memory demand.
引用
收藏
页码:773 / 807
页数:35
相关论文
共 37 条
[1]   Rare itemset mining [J].
Adda, Mehdi ;
Wu, Lei ;
Feng, Yi .
ICMLA 2007: SIXTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, PROCEEDINGS, 2007, :73-+
[2]   Parallel mining of association rules [J].
Agrawal, R ;
Shafer, JC .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) :962-969
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]   DATABASE MINING - A PERFORMANCE PERSPECTIVE [J].
AGRAWAL, R ;
IMIELINSKI, T ;
SWAMI, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) :914-925
[5]  
AGRAWAL R, 1994, 20 VLDB C
[6]  
Agrawal R., 1996, ADV KNOWLEDGE DISCOV
[7]  
[Anonymous], 1991, KNOWLEDGE DISCOVERY
[8]  
[Anonymous], 1994, KDD
[9]  
[Anonymous], ACT 15 JOURN BAS DON
[10]  
[Anonymous], P 5 INT C CONC LATT