An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:2
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
[41]   Applications of Concurrent Sequential Patterns in Protein Data Mining [J].
Wang, Cuiqing ;
Lu, Jing ;
Keech, Malcolm .
MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 :243-257
[42]   A data mining approach to discovering reliable sequential patterns [J].
Shyur, Huan-Jyh ;
Jou, Chichang ;
Chang, Keng .
JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (08) :2196-2203
[43]   A fuzzy data mining algorithm for finding sequential patterns [J].
Hu, YC ;
Chen, RS ;
Tzeng, GH ;
Shieh, JH .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2003, 11 (02) :173-193
[44]   Online mining abnormal period patterns from multiple medical sensor data streams [J].
Huang, Guangyan ;
Zhang, Yanchun ;
Cao, Jie ;
Steyn, Michael ;
Taraporewalla, Kersi .
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (04) :569-587
[45]   WSFI-Mine: Mining Frequent Patterns in Data Streams [J].
Kim, Younghee ;
Kim, Ungmo .
ADVANCES IN NEURAL NETWORKS - ISNN 2009, PT 2, PROCEEDINGS, 2009, 5552 :845-852
[46]   Online mining abnormal period patterns from multiple medical sensor data streams [J].
Guangyan Huang ;
Yanchun Zhang ;
Jie Cao ;
Michael Steyn ;
Kersi Taraporewalla .
World Wide Web, 2014, 17 :569-587
[47]   Mining frequent closed patterns with item constraints in data streams [J].
Hu, Wei-Cheng ;
Wang, Ben-Nian ;
Cheng, Zhuan-Liu .
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, :274-280
[48]   Mining Multi-Relational Frequent Patterns in Data Streams [J].
Hou, Wei ;
Yang, Bingru ;
Xie, Yonghong ;
Wu, Chensheng .
2009 INTERNATIONAL CONFERENCE ON BUSINESS INTELLIGENCE AND FINANCIAL ENGINEERING, PROCEEDINGS, 2009, :205-209
[49]   Sliding-Window Based Method to Discover High Utility Patterns from Data Streams [J].
Manike, Chiranjeevi ;
Om, Hari .
COMPUTATIONAL INTELLIGENCE IN DATA MINING, VOL 3, 2015, 33
[50]   Privacy preserving data mining of sequential patterns for network traffic data [J].
Kim, Seung-Woo ;
Park, Sanghyun ;
Won, Jung-Im ;
Kim, Sang-Wook .
ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 :201-+