Sampling large databases for association rules

被引:0
|
作者
Toivonen, H
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovery of association rules is an important database mining problem. Current algorithms for finding association rules require several passes over the analyzed database, and obviously the role of I/O overhead is very significant for very large databases. We present new algorithms that reduce the database activity considerably. The idea is to pick a random sample, to find using this sample all association rules that probably hold in the whole database, and then to verify the results with the rest of the database. The algorithms thus produce exact association rules, not approximations based on a sample. The approach is, however, probabilistic, and in those rare cases where our sampling method does not produce all association rules, the missing rules can be found in a second pass. Our experiments show that the proposed algorithms can find association rules very efficiently in only one database pass.
引用
收藏
页码:134 / 145
页数:12
相关论文
共 50 条
  • [31] New sampling method for mining association rules on very large database
    Zhang, Z.G. (zhangzhaogong@0451.com), 2001, Chinese Academy of Sciences (12):
  • [32] Discovery of association rules in temporal databases
    Tansel, Abdullah Uz
    Imberman, Susan P.
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 371 - +
  • [33] Mining dynamic association rules in databases
    Liu, JF
    Rong, G
    COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 688 - 695
  • [34] Mining a complete set of both positive and negative association rules from large databases
    Wang, Hao
    Zhang, Xing
    Chen, Guoqing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 777 - 784
  • [35] Efficient Graph-Based Algorithms for Discovering and Maintaining Association Rules in Large Databases
    Guanling Lee
    K. L. Lee
    Arbee L. P. Chen
    Knowledge and Information Systems, 2001, 3 (3) : 338 - 355
  • [36] An Efficient Approach for Mining Positive and Negative Association Rules from Large Transactional Databases
    Kishor, Peddi
    Porika, Sammulal
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 85 - 89
  • [37] An efficient algorithm for mining quantitative association rules to raise reliance of data in large databases
    Lee, HJ
    Park, WH
    Park, DS
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 672 - 681
  • [38] Probabilistic approach to association rules in incomplete databases
    Kryszkiewicz, M
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2000, 1846 : 133 - 138
  • [39] Efficient mining of association rules in text databases
    Holt, JD
    Chung, SM
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON INFORMATION KNOWLEDGE MANAGEMENT, CIKM'99, 1999, : 234 - 242
  • [40] Association rules mining in vertically partitioned databases
    Rozenberg, Boris
    Gudes, Ehud
    DATA & KNOWLEDGE ENGINEERING, 2006, 59 (02) : 378 - 396