Cubegrades: Generalizing association rules

被引:37
作者
Imielinski, T [1 ]
Khachiyan, L [1 ]
Abdulghani, A [1 ]
机构
[1] Rutgers State Univ, Dept Comp Sci, Piscataway, NJ 08854 USA
基金
美国国家科学基金会;
关键词
database mining; cubegrades; association rules; cube generation; OLAP;
D O I
10.1023/A:1015417610840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cubegrades are a generalization of association rules which represent how a set of measures (aggregates) is affected by modifying a cube through specialization (rolldown), generalization (rollup) and mutation (which is a change in one of the cube's dimensions). Cubegrades are significantly more expressive than association rules in capturing trends and patterns in data because they can use other standard aggregate measures, in addition to COUNT. Cubegrades are atoms which can support sophisticated "what if" analysis tasks dealing with behavior of arbitrary aggregates over different database segments. As such, cubegrades can be useful in marketing, sales analysis, and other typical data mining applications in business. In this paper we introduce the concept of cubegrades. We define them and give examples of their usage. We then describe in detail an important task for computing cubegrades: generation of significant cubes which is analogous to generating frequent sets. A novel Grid Based Pruning (GBP) method is employed for this purpose. We experimentally demonstrate the practicality of the method. We conclude with a number of open questions and possible extensions of the work.
引用
收藏
页码:219 / 257
页数:39
相关论文
共 22 条
[1]  
ABDULGHANI A, UNPUB DATA MINING KN
[2]  
Agarwal S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P506
[3]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[4]  
Agrawal R., 1996, Advances in Knowledge Discovery and Data Mining, P307
[5]  
Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487
[6]  
[Anonymous], P ACM SIGMOD 98
[7]   A multi-level view model for secure object-oriented databases [J].
BaraaniDastjerdi, A ;
Pieprzyk, J ;
SafaviNaini, R .
DATA & KNOWLEDGE ENGINEERING, 1997, 23 (02) :97-117
[8]  
BASU S, 1997, FDN COMPUTER SCI FOC
[9]  
Beyer K, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P359, DOI 10.1145/304181.304214
[10]   Data cube: A relational aggregation operator generalizing group-by, cross-tab, and sub-totals [J].
Gray, J ;
Bosworth, A ;
Layman, A ;
Pirahesh, H .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :152-159