Infrequent Weighted Itemset Mining Using Frequent Pattern Growth

被引:44
作者
Cagliero, Luca [1 ]
Garza, Paolo [1 ]
机构
[1] Politecn Torino, Dipartimento Automat & Informat, I-10129 Turin, Italy
关键词
Clustering; classification; and association rules; data mining; ALGORITHM;
D O I
10.1109/TKDE.2013.69
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent weighted itemsets represent correlations frequently holding in data in which items may weight differently. However, in some contexts, e.g., when the need is to minimize a certain cost function, discovering rare data correlations is more interesting than mining frequent ones. This paper tackles the issue of discovering rare and weighted itemsets, i.e., the infrequent weighted itemset (IWI) mining problem. Two novel quality measures are proposed to drive the IWI mining process. Furthermore, two algorithms that perform IWI and Minimal IWI mining efficiently, driven by the proposed measures, are presented. Experimental results show efficiency and effectiveness of the proposed approach.
引用
收藏
页码:903 / 915
页数:13
相关论文
共 20 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], 2003, P 9 ACM SIGKDD INT C
[3]  
[Anonymous], IBM QUEST SYNTH DAT
[4]  
Antonie M.-L., 2001, P 2 INT WORKSH MULT
[5]  
Bernecker T, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P119
[6]  
Chui CK, 2007, LECT NOTES COMPUT SC, V4426, P47
[7]  
CONG G, 2004, P ACM SIGMOD INT C M
[8]  
Dong X., 2007, 2 INT C INNOVATIVE C, P528
[9]   A computational algorithm for handling the special uniques problem [J].
Elliot, MJ ;
Manning, AM ;
Ford, RW .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2002, 10 (05) :493-509
[10]  
Frank A., 2010, UCI Machine Learning Repository