An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:2
|
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
  • [1] Mining Regular Patterns in Data Streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 399 - 413
  • [2] Efficient mining method for retrieving sequential patterns over online data streams
    Chang, JH
    Lee, WS
    JOURNAL OF INFORMATION SCIENCE, 2005, 31 (05) : 420 - 432
  • [3] A SINGLE-SCAN ALGORITHM FOR MINING SEQUENTIAL PATTERNS FROM DATA STREAMS
    Li, Hua-Fu
    Ho, Chin-Chuan
    Chen, Hsuan-Sheng
    Lee, Suh-Yin
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (3A): : 1799 - 1820
  • [4] Mining Rare Sequential Patterns in Data Streams with a Sliding Window
    Ouyang, Weimin
    2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 1023 - 1027
  • [5] An Obstruction-Check Approach to Mining Closed Sequential Patterns in Data Streams
    Chang, Ye-In
    Li, Chia-En
    Chin, Tzu-Lin
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 521 - 530
  • [6] SPEED :: Mining maximal sequential patterns over data streams
    Raissi, Chedy
    Poncelet, Pascal
    Teisseire, Maguelonne
    2006 3RD INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2006, : 537 - 543
  • [7] Mining Patterns From Data Streams: An Overview
    Borah, Anindita
    BhabeshNath
    2017 INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC), 2017, : 371 - 376
  • [8] PTree: Mining Sequential Patterns Efficiently in Multiple Data Streams Environment
    Lee, Guanling
    Chen, Yi-Chun
    Hung, Kuo-Che
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2013, 29 (06) : 1151 - 1169
  • [9] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936
  • [10] Mining negative sequential patterns
    Lin, Nancy P.
    Chen, Hung-Jen
    Hao, Wei-Hua
    PROCEEDINGS OF THE 6TH WSEAS INTERNATIONAL CONFERENCE ON APPLIED COMPUTER SCIENCE, 2007, : 658 - +