Mining Pareto-optimal rules with respect to support and confirmation or support and anti-support

被引:26
作者
Brzezinska, Izabela
Greco, Salvatore
Slowinski, Roman
机构
[1] Poznan Univ Tech, Inst Comp Sci, PL-60965 Poznan, Poland
[2] Univ Catania, Fac Econ, I-95129 Catania, Italy
[3] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
关键词
knowledge discovery; association rules; decision rules; Bayesian confirmation measures; monotonicity in rule support; confidence and confirmation; pareto-optimal rules;
D O I
10.1016/j.engappai.2006.11.015
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
in knowledge discovery and data mining many measures of interestingness have been proposed in order to measure the relevance and utility of the discovered patterns. Among these measures, an important role is played by Bayesian confirmation measures, which express in what degree a premise confirms a conclusion. In this paper, we are considering knowledge patterns in a form of "if..., then..." rules with a fixed conclusion. We investigate a monotone link between Bayesian confirmation measures, and classic dimensions being rule support and confidence. In particular, we formulate and prove conditions for monotone dependence of two confirmation measures enjoying some desirable properties on rule support and confidence. As the confidence measure is unable to identify and eliminate noninteresting rules, for which a premise does not confirm a conclusion, we propose to substitute the confidence for one of the considered confirmation measures in mining the Pareto-optimal rules. We also provide general conclusions for the monotone link between any confirmation measure enjoying the desirable properties and rule support and confidence. Finally, we propose to mine rules maximizing rule support and minimizing rule anti-support, which is the number of examples, which satisfy the premise of the rule but not its conclusion (called counter-examples of the considered rule). We prove that in this way we are able to mine all the rules maximizing any confirmation measure enjoying the desirable properties. We also prove that this Pareto-optimal set includes all the rules from the previously considered Pareto-optimal borders. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:587 / 600
页数:14
相关论文
共 24 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
[Anonymous], P 2 INT C KNOWL DISC
[3]  
[Anonymous], 1962, LOGICAL FDN PROBABIL
[4]  
Bayardo RJ, KDD 99 P 5 ACM SIGKD, DOI [10.1145/312129.312219, DOI 10.1145/312129.312219]
[5]  
Brin S., 1997, SIGMOD Record, V26, P265, DOI [10.1145/253262.253325, 10.1145/253262.253327]
[6]  
BRZEZINSKA I, 2005, RECENT DEV ARTIFICIA, P39
[7]  
BRZEZINSKA I, 2005, RA02505 POZN U TECHN
[8]   Measuring confirmation (Theory of rational belief) [J].
Christensen, D .
JOURNAL OF PHILOSOPHY, 1999, 96 (09) :437-461
[9]  
Clark P., 1991, MACHINE LEARNING EWS, P151, DOI 10.1007/bfb0017011
[10]  
Eeills E, 2002, PHILOS STUD, V107, P129