Efficient quantitative frequent pattern mining using predicate trees

被引:0
作者
Wang, BY [1 ]
Pan, F [1 ]
Cui, Y [1 ]
Perrizo, W [1 ]
机构
[1] N Dakota State Univ, Dept Comp Sci, Fargo, ND 58105 USA
来源
COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING | 2003年
关键词
ARM; frequent pattern mining; quantitative frequent pattern; predicate trees;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Databases information is not limited to categorical attributes, but also contains much quantitative data. The typical approach of quantitative association rule mining is achieved by using intervals. Such approach involves many interval generations and merges, and requires reconstruction of tree structures, such as hash trees, R-trees, and FP-trees. When intervals change, these tree structures are re-constructed on-the-fly, which is very time-consuming. In this paper, we present a Predicate tree based quantitative frequent pattern mining algorithm (PQM). The central idea of PQM is to exploit Predicate trees to facilitate quantitative frequent mining without tree re-construction. Our method has three major advantages: 1) P-trees are pre-generated tree structures, which are flexible and efficient for any data partition and any interval optimization; 2) PQM is efficient by using fast P-tree logic operations; and 3) PQM has better performance due to the vertically decomposed structure and compression of P-trees. Experiments show that our algorithm outperforms Apriori algorithm by orders of magnitude with better support threshold scalability and cardinality scalability.
引用
收藏
页码:168 / 171
页数:4
相关论文
共 10 条
  • [1] Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
  • [2] AGRAWAL R, P VLDB 94, P487
  • [3] Aumann Y., 1999, P 5 ACM SIGKDD INT C, P261
  • [4] Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
  • [5] DING Q, 2002, P PAKDD2002 TAIP TAI
  • [6] HAN J, 2000, P ACM SIGMOD C MAN D
  • [7] PARK JS, 1995, P ACM SIGMOD C MAN D
  • [8] PERRIZO W, 2001, NDSUCSORTR011
  • [9] Srikant R., 1996, P ACM SIGMOD C MAN D
  • [10] *TIFF, TIFF IM DAT SETS