Visual comparison of association rules

被引:15
作者
Hofmann, H [1 ]
Wilhelm, A [1 ]
机构
[1] Univ Augsburg, Inst Math, D-86135 Augsburg, Germany
关键词
association rules; confidence; double-decker plots; support; visualization of association rules;
D O I
10.1007/s001800100075
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Rule induction methods are widely applied tools for mining large data bases. They are often used as a starting point in undirected data mining, i.e. when you do not know what specific patterns to look for. One form of rule induction methods are association rules which have their origin in market basket analysis. Since an evaluation of their results is often hard due to the mass of output, pruning methods are needed to turn the output of association rules into a manageable number of patterns. We will present some statistical measures and their depictions that are useful to assess the quality of an association rule. We will show plots portraying confidence and support for individual rules as well as for sets of rules as alternatives to displays currently used in commercial software. A new quality measure for association rules, the doc, will be introduced which overcomes some of the problems of support and confidence of association rules and can be visualised even for hundreds of rules in one diagram.
引用
收藏
页码:399 / 415
页数:17
相关论文
共 9 条
  • [1] AGRAWAL R, P ACM SIGMOD 1993, P207
  • [2] Bayardo R.J., 1999, P 5 ACM SIGKDD INT C, P145, DOI [10.1145/312129.312219, DOI 10.1145/312129.312219]
  • [3] Hofmann H., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P227, DOI 10.1145/347090.347133
  • [4] HOFMANN H, 2000, COMPSTAT 2000 P COMP, P499
  • [5] KLEMETTINEN M, 1994, P 3 INT C INF KNOWL, P401
  • [6] Piatetsky-Shapiro G., 1991, Knowledge discovery in databases, P229
  • [7] Piatetsky-Shapiro G., 1999, P 5 ACM SIGKDD INT C, P185, DOI [10.1145/312129.312225, DOI 10.1145/312129.312225]
  • [8] Srikant R, 1995, MINING GEN ASS RULES
  • [9] TOIVONEN H, 1995, MLNET WORKSH STAT MA, P47