An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:4
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
[21]   NSPIS: Mining Negative Sequential Patterns with Individual Support [J].
Huang, Gengsen ;
Gan, Wensheng ;
Huang, Shan ;
Chen, Jiahui ;
Chen, Chien-Ming .
2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, :5507-5516
[22]   SPPC: a new tree structure for mining erasable patterns in data streams [J].
Le, Tuong ;
Vo, Bay ;
Fournier-Viger, Philippe ;
Lee, Mi Young ;
Baik, Sung Wook .
APPLIED INTELLIGENCE, 2019, 49 (02) :478-495
[23]   An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams [J].
Shajib, Md. Badi-Uz-Zaman ;
Samiullah, Md. ;
Ahmed, Chowdhury Farhan ;
Leung, Carson K. ;
Pazdor, Adam G. M. .
2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, :980-984
[24]   Hyper-structure mining of frequent patterns in uncertain data streams [J].
HewaNadungodage, Chandima ;
Xia, Yuni ;
Lee, Jaehwan John ;
Tu, Yi-cheng .
KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) :219-244
[25]   An efficient algorithm for mining maximal frequent patterns over data streams [J].
Yang, Junrui ;
Wei, Yanjun ;
Zhou, Fenfen .
2015 7TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS IHMSC 2015, VOL II, 2015,
[26]   SPPC: a new tree structure for mining erasable patterns in data streams [J].
Tuong Le ;
Bay Vo ;
Philippe Fournier-Viger ;
Mi Young Lee ;
Sung Wook Baik .
Applied Intelligence, 2019, 49 :478-495
[27]   Hyper-structure mining of frequent patterns in uncertain data streams [J].
Chandima HewaNadungodage ;
Yuni Xia ;
Jaehwan John Lee ;
Yi-cheng Tu .
Knowledge and Information Systems, 2013, 37 :219-244
[28]   Data mining on time series of sequential patterns [J].
Visa, A .
DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY IV, 2002, 4730 :166-171
[29]   A MapReduce solution for incremental mining of sequential patterns from big data [J].
Saleti, Sumalatha ;
Subramanyam, R. B., V .
EXPERT SYSTEMS WITH APPLICATIONS, 2019, 133 :109-125
[30]   EclatDS: An efficient sliding window based frequent pattern mining method for data streams [J].
Deypir, Mahmood ;
Sadreddini, Mohammad Hadi .
INTELLIGENT DATA ANALYSIS, 2011, 15 (04) :571-587