Behavior-based clustering and analysis of interestingness measures for association rule mining

被引:50
作者
Tew, C. [1 ]
Giraud-Carrier, C. [1 ]
Tanner, K. [1 ]
Burton, S. [1 ]
机构
[1] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA
关键词
Interestingness measures; Clustering; Behavior analysis; Association rule mining; CLASSIFICATION; ATTRIBUTES; INFORMATION; TESTS;
D O I
10.1007/s10618-013-0326-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of studies, theoretical, empirical, or both, have been conducted to provide insight into the properties and behavior of interestingness measures for association rule mining. While each has value in its own right, most are either limited in scope or, more importantly, ignore the purpose for which interestingness measures are intended, namely the ultimate ranking of discovered association rules. This paper, therefore, focuses on an analysis of the rule-ranking behavior of 61 well-known interestingness measures tested on the rules generated from 110 different datasets. By clustering based on ranking behavior, we highlight, and formally prove, previously unreported equivalences among interestingness measures. We also show that there appear to be distinct clusters of interestingness measures, but that there remain differences among clusters, confirming that domain knowledge is essential to the selection of an appropriate interestingness measure for a particular task and business objective.
引用
收藏
页码:1004 / 1045
页数:42
相关论文
共 101 条
  • [1] Abe H, 2008, LECT NOTES ARTIF INT, V5178, P758, DOI 10.1007/978-3-540-85565-1_94
  • [2] Aggarwal C. C., 1998, Proceedings of the Seventeenth ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems. PODS 1998, P18, DOI 10.1145/275487.275490
  • [3] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [4] Agrawal R., P 20 INT C VERY LARG
  • [5] Ali K., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P115
  • [6] [Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
  • [7] [Anonymous], 2005, 11 INT S APPL STOCH
  • [8] [Anonymous], MATH SCI HUM
  • [9] [Anonymous], 2007, Uci machine learning repository
  • [10] [Anonymous], 2007, R LANG ENV STAT COMP