Behavior-based clustering and analysis of interestingness measures for association rule mining

被引:0
作者
C. Tew
C. Giraud-Carrier
K. Tanner
S. Burton
机构
[1] Brigham Young University,Department of Computer Science
来源
Data Mining and Knowledge Discovery | 2014年 / 28卷
关键词
Interestingness measures; Clustering; Behavior analysis; Association rule mining;
D O I
暂无
中图分类号
学科分类号
摘要
A number of studies, theoretical, empirical, or both, have been conducted to provide insight into the properties and behavior of interestingness measures for association rule mining. While each has value in its own right, most are either limited in scope or, more importantly, ignore the purpose for which interestingness measures are intended, namely the ultimate ranking of discovered association rules. This paper, therefore, focuses on an analysis of the rule-ranking behavior of 61 well-known interestingness measures tested on the rules generated from 110 different datasets. By clustering based on ranking behavior, we highlight, and formally prove, previously unreported equivalences among interestingness measures. We also show that there appear to be distinct clusters of interestingness measures, but that there remain differences among clusters, confirming that domain knowledge is essential to the selection of an appropriate interestingness measure for a particular task and business objective.
引用
收藏
页码:1004 / 1045
页数:41
相关论文
共 85 条
[1]  
Agrawal R(1993)Mining association rules between sets of items in large databases ACM SIGMOD Rec 22 207-216
[2]  
Imieliński T(2006)Loevinger’s measures of rule quality for assessing cluster stability Comput Stat Data Anal 50 992-1015
[3]  
Swami A(2002)Measuring the accuracy and interest of association rules: a new framework Intell Data Anal 6 221-235
[4]  
Bertrand P(1968)The amount of information that y gives about x IEEE Trans Inf Theory 14 27-31
[5]  
Bel Mufti G(1960)A coefficient of agreement for nominal scales Educ Psychol Meas 20 37-46
[6]  
Berzal F(1998)Averaging correlations: expected values and bias in combined Pearson J Gen Psychol 125 245-261
[7]  
Blanco I(2010)s and Fisher’s SIGKDD Explor 12 92-100
[8]  
Sánchez D(1957) transformations I. Biometrika 44 470-481
[9]  
Vila MA(2005)A framework for mining interesting pattern sets Mach Learn 58 39-77
[10]  
Blachman N(2006)Test for rank correlation coefficients ACM Comput Surv 38 1-32