SPPC: a new tree structure for mining erasable patterns in data streams

被引:0
|
作者
Tuong Le
Bay Vo
Philippe Fournier-Viger
Mi Young Lee
Sung Wook Baik
机构
[1] Sejong University,Digital Contents Research Institute
[2] Ton Duc Thang University,Division of Data Science
[3] Ton Duc Thang University,Faculty of Information Technology
[4] Harbin Institute of Technology (Shenzhen),School of Natural Sciences and Humanities
来源
Applied Intelligence | 2019年 / 49卷
关键词
Data mining; Data streams; Erasable patterns; Sliding window;
D O I
暂无
中图分类号
学科分类号
摘要
Discovering Erasable Patterns (EPs) consists of identifying product parts that will produce a small profit loss if their production is stopped. It is a data mining problem that has attracted the attention of numerous researchers in recent years due to the possibility of using EPs to reduce profit loss of manufacturers. Though, many algorithms have been designed to mine EPs, an important limitation of state-of-the-art EP mining algorithms is that they are batch algorithms, that is, they are designed to be applied on static databases. But in real-life applications, databases are dynamic, as they are constantly updated by adding or removing products and parts. To be informed about EPs in real-time, traditional EP mining algorithms must be applied over and over again on a database. This is inefficient as those algorithms are always applied from scratch without taking advantage of results generated by previous executions. Considering this important drawback of previous work for handling real-life dynamic data, this paper proposes an efficient algorithm named MSPPC for mining EPs in data streams. It relies on a novel tree structure named SPPC (Streaming Pre-Post Code) tree, which extends the WPPC tree structure for maintaining a compact tree representation of EPs in a data stream. Experimental results show that the designed MSPPC algorithm outperforms the state-of-the-art batch MERIT and dMERIT algorithms when they are run in batch mode using a sliding-window. Besides, the proposed algorithm is also faster than the state-of-the-art algorithms for mining EPs, namely MERIT, dMERIT + , MEI and EIFDD.
引用
收藏
页码:478 / 495
页数:17
相关论文
共 50 条
  • [31] Mining Multi-Relational Frequent Patterns in Data Streams
    Hou, Wei
    Yang, Bingru
    Xie, Yonghong
    Wu, Chensheng
    2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, : 205 - 209
  • [32] Mining Popular Patterns: A Novel Mining Problem and Its Application to Static Transactional Databases and Dynamic Data Streams
    Cuzzocrea, Alfredo
    Jiang, Fan
    Leung, Carson K.
    Liu, Dacheng
    Peddle, Aaron
    Tanbeer, Syed K.
    TRANSACTIONS ON LARGE-SCALE DATA- AND KNOWLEDGE-CENTERED SYSTEMS XXI, 2015, 9260 : 115 - 139
  • [33] New algorithm for Frequent Itemsets Mining From Evidential Data Streams
    Farhat, Amine
    Gouider, Mohamed Salah
    Ben Said, Lamjed
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS: PROCEEDINGS OF THE 20TH INTERNATIONAL CONFERENCE KES-2016, 2016, 96 : 645 - 653
  • [34] Efficient approach for incremental weighted erasable pattern mining with list structure
    Nam, Hyoju
    Yun, Unil
    Yoon, Eunchul
    Lin, Jerry Chun-Wei
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143
  • [35] A Sliding Window-Based Approach for Mining Frequent Weighted Patterns Over Data Streams
    Bui, Huong
    Nguyen-Hoang, Tu-Anh
    Vo, Bay
    Nguyen, Ham
    Le, Tuong
    IEEE ACCESS, 2021, 9 : 56318 - 56329
  • [36] A Landmark-Model Based System for Mining Frequent Patterns from Uncertain Data Streams
    Leung, Carson Kai-Sang
    Jiang, Fan
    Hayduk, Yaroslav
    PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 249 - 250
  • [37] An efficient algorithm for mining maximal frequent patterns over data streams
    Yang, Junrui
    Wei, Yanjun
    Zhou, Fenfen
    2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
  • [38] Mining maximal frequent itemsets in data streams based on FP-Tree
    Ao, Fujiang
    Yan, Yuejin
    Huang, Jian
    Huang, Kedi
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS, 2007, 4571 : 479 - +
  • [39] Tree-Based Unified Temporal Erasable-Itemset Mining
    Hong, Tzung-Pei
    Li, Jia-Xiang
    Tsai, Yu-Chuan
    Huang, Wei-Ming
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2023, PT I, 2023, 13995 : 224 - 233
  • [40] A three-phase approach to differentially private crucial patterns mining over data streams
    Wang, Jinyan
    Liu, Chen
    Fu, Xingcheng
    Luo, Xudong
    Li, Xianxian
    COMPUTERS & SECURITY, 2019, 82 : 30 - 48