Efficient incremental mining of contrast patterns in changing data

被引:11
|
作者
Bailey, James [1 ]
Loekito, Elsa [1 ]
机构
[1] Univ Melbourne, Dept Comp Sci & Software Engn, Melbourne, Vic 3010, Australia
关键词
Data mining; Contrast patterns; Databases;
D O I
10.1016/j.ipl.2009.10.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A contrast pattern is a set of items (itemset) whose frequency differs significantly between two classes of data. Such patterns describe distinguishing characteristics between datasets, are meaningful to human experts, have strong discriminating ability and can be used for powerful classifiers. Incrementally mining such patterns is very important for evolving datasets, where transactions can be either inserted or deleted and mining needs to be repeated after changes occur. When the change is small, it is undesirable to carry out mining from scratch. Rather, the set of previously mined contrast patterns should be reused where possible to compute the new patterns. A primary example of evolving data is a data stream, where the data is a sequence of continuously arriving transactions (or itemsets). in this paper, we propose an efficient technique for incrementally mining contrast patterns. Our algorithm particularly aims to avoid redundant computation which might occur due to simultaneous transaction insertion and deletion, as is the case for data streams. in an experimental study using real and synthetic data streams, we show our algorithm can be substantially faster than the previous approach. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:88 / 92
页数:5
相关论文
共 50 条
  • [1] An efficient algorithm for incremental mining of sequential patterns
    Ren, Jia-Dong
    Zhou, Xiao-Lei
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 179 - 188
  • [2] Incremental mining of association patterns on compressed data
    Ng, VTY
    Wong, JML
    Bao, P
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 441 - 446
  • [3] An efficient incremental algorithm for mining web traversal patterns
    Yen, SJ
    Lee, YS
    Hsieh, MC
    ICEBE 2005: IEEE INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING, PROCEEDINGS, 2005, : 274 - 281
  • [4] A fuzzy data mining algorithm for incremental mining of quantitative sequential patterns
    Subramanyam, RBV
    Goswami, A
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2005, 13 (06) : 633 - 652
  • [5] CanTree: A tree structure for efficient incremental mining of frequent patterns
    Leung, CKS
    Khan, QI
    Hoque, T
    Fifth IEEE International Conference on Data Mining, Proceedings, 2005, : 274 - 281
  • [6] A novel approach for mining frequent patterns from incremental data
    Jindal, Rajni
    Borah, Malaya Dutta
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (03) : 244 - 264
  • [7] Incremental mining of closed sequential patterns in multiple data streams
    Yang S.-Y.
    Chao C.-M.
    Chen P.-Z.
    Sun C.-H.
    Journal of Networks, 2011, 6 (05) : 728 - 735
  • [8] An efficient incremental algorithm for mining web navigation patterns with dynamic thresholds
    Ying, Jia-Ching
    Tseng, Vincent S.
    ICIC Express Letters, 2010, 4 (05): : 1625 - 1630
  • [9] Efficient monitoring of patterns in data mining environments
    Baron, S
    Spiliopoulou, M
    Günther, O
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2003, 2798 : 253 - 265
  • [10] Efficient data mining for path traversal patterns
    Chen, MS
    Park, JS
    Yu, PS
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) : 209 - 221