An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:2
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
  • [21] NSPIS: Mining Negative Sequential Patterns with Individual Support
    Huang, Gengsen
    Gan, Wensheng
    Huang, Shan
    Chen, Jiahui
    Chen, Chien-Ming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5507 - 5516
  • [22] SPPC: a new tree structure for mining erasable patterns in data streams
    Le, Tuong
    Vo, Bay
    Fournier-Viger, Philippe
    Lee, Mi Young
    Baik, Sung Wook
    APPLIED INTELLIGENCE, 2019, 49 (02) : 478 - 495
  • [23] Hyper-structure mining of frequent patterns in uncertain data streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, Jaehwan John
    Tu, Yi-cheng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 219 - 244
  • [24] An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams
    Shajib, Md. Badi-Uz-Zaman
    Samiullah, Md.
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 980 - 984
  • [25] An efficient algorithm for mining maximal frequent patterns over data streams
    Yang, Junrui
    Wei, Yanjun
    Zhou, Fenfen
    2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
  • [26] SPPC: a new tree structure for mining erasable patterns in data streams
    Tuong Le
    Bay Vo
    Philippe Fournier-Viger
    Mi Young Lee
    Sung Wook Baik
    Applied Intelligence, 2019, 49 : 478 - 495
  • [27] Hyper-structure mining of frequent patterns in uncertain data streams
    Chandima HewaNadungodage
    Yuni Xia
    Jaehwan John Lee
    Yi-cheng Tu
    Knowledge and Information Systems, 2013, 37 : 219 - 244
  • [28] Data mining on time series of sequential patterns
    Visa, A
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 : 166 - 171
  • [29] EclatDS: An efficient sliding window based frequent pattern mining method for data streams
    Deypir, Mahmood
    Sadreddini, Mohammad Hadi
    INTELLIGENT DATA ANALYSIS, 2011, 15 (04) : 571 - 587
  • [30] Mining maximal frequent itemsets from data streams
    Mao, Guojun
    Wu, Xindong
    Zhu, Xingquan
    Chen, Gong
    Liu, Chunnian
    JOURNAL OF INFORMATION SCIENCE, 2007, 33 (03) : 251 - 262