An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:4
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
[31]   Mining maximal frequent itemsets from data streams [J].
Mao, Guojun ;
Wu, Xindong ;
Zhu, Xingquan ;
Chen, Gong ;
Liu, Chunnian .
JOURNAL OF INFORMATION SCIENCE, 2007, 33 (03) :251-262
[32]   Mining the frequent patterns in an arbitrary sliding window over online data streams [J].
Li, Guo-Hui ;
Chen, Hui .
Ruan Jian Xue Bao/Journal of Software, 2008, 19 (10) :2585-2596
[33]   Damped sliding based mining of average utility-driven sequential patterns over uncertain data streams [J].
Zhang, Ruihua ;
Han, Meng ;
He, Feifei ;
Meng, Fanxing ;
Li, Chunpeng .
NEUROCOMPUTING, 2025, 647
[34]   EFFICIENTLY MINING RECENT FREQUENT PATTERNS OVER ONLINE TRANSACTIONAL DATA STREAMS [J].
Chen, Hui .
INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2009, 19 (05) :707-725
[35]   Mining maximal frequent patterns by considering weight conditions over data streams [J].
Yun, Unil ;
Lee, Gangin ;
Ryu, Keun Ho .
KNOWLEDGE-BASED SYSTEMS, 2014, 55 :49-65
[36]   Mining top-k high utility patterns over data streams [J].
Zihayat, Morteza ;
An, Aijun .
INFORMATION SCIENCES, 2014, 285 :138-161
[37]   An Efficient Algorithm for Mining Frequent Patterns over High Speed Data Streams [J].
Meng, Cai-xia .
2009 WRI WORLD CONGRESS ON SOFTWARE ENGINEERING, VOL 1, PROCEEDINGS, 2009, :319-323
[38]   Mining Top-K Sequential Patterns in the Data Stream Environment [J].
Dai, Bi-Ru ;
Jiang, Hung-Lin ;
Chung, Chih-Heng .
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, :142-149
[39]   SPAMS: A Novel Incremental Approach for Sequential Pattern Mining in Data Streams [J].
Vinceslas, Lionel ;
Symphor, Jean-Emile ;
Mancheron, Alban ;
Poncelet, Pascal .
ADVANCES IN KNOWLEDGE DISCOVERY AND MANAGEMENT, 2010, 292 :201-216
[40]   PSP-AMS: Progressive Mining of Sequential Patterns Across Multiple Streams [J].
Jaysawal, Bijay Prasad ;
Huang, Jen-Wei .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (01)