A survey of high utility sequential patterns mining methods

被引:0
作者
Zhang, Ruihua [1 ]
Han, Meng [1 ,2 ]
He, Feifei [1 ]
Meng, Fanxing [1 ]
Li, Chunpeng [1 ]
机构
[1] North Minzu Univ, Sch Comp Sci & Engn, Yinchuan, Ningxia, Peoples R China
[2] State Ethn Affairs Commiss, Key Lab Intelligent Proc Image & Graph, Yinchuan, Ningxia, Peoples R China
基金
中国国家自然科学基金;
关键词
Survey; high utility sequential patterns; incremental data; data streams; hidden patterns; EFFICIENT ALGORITHM; PREFIXSPAN; INTERNET;
D O I
10.3233/JIFS-232107
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, there has been an increasing demand for high utility sequential pattern (HUSP) mining. Different from high utility itemset mining, the "combinatorial explosion" problem of sequence data makes it more challenging. This survey aims to provide a general, comprehensive, and structured overview of the state-of-the-art methods of HUSP from a novel perspective. Firstly, from the perspective of serial and parallel, the data structure used by the mining methods are illustrated and the pros and cons of the algorithms are summarized. In order to protect data privacy, many HUSP hiding algorithms have been proposed, which are classified into array-based, chain-based and matrix-based algorithms according to the key technologies. The hidden strategies and evaluation metrics adopted by the algorithms are summarized. Next, a taxonomy of the most common and the state-of-the-art approaches for incremental mining algorithms is presented, including tree-based and projection-based. In order to deal with the latest sequence in the data stream, the existing algorithms often use the window model to update dynamically, and the algorithms are divided into methods based on sliding windows and landmark windows for analysis. Afterwards, a summary of derived high utility sequential pattern is presented. Finally, aiming at the deficiencies of the existing HUSP research, the next work that the author plans to do is given.
引用
收藏
页码:8049 / 8077
页数:29
相关论文
共 91 条
[11]  
Buffett S, 2018, IEEE INT CONF BIG DA, P644, DOI 10.1109/BigData.2018.8622138
[12]  
Cheng S, 2015, Comput Syst Appl, V12, P228
[13]  
Chunkai Zhang, 2019, 2019 IEEE 21st International Conference on High Performance Computing and Communications
[14]  
IEEE 17th International Conference on Smart City
[15]  
IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Proceedings, P2798, DOI 10.1109/HPCC/SmartCity/DSS.2019.00392
[16]  
Dinh DT, 2019, STUD BIG DATA, V51, P207, DOI 10.1007/978-3-030-04921-8_8
[17]   An efficient algorithm for mining periodic high-utility sequential patterns [J].
Duy-Tai Dinh ;
Bac Le ;
Fournier-Viger, Philippe ;
Van-Nam Huynh .
APPLIED INTELLIGENCE, 2018, 48 (12) :4694-4714
[18]  
Fournier-Viger Philippe, 2014, Foundations of Intelligent Systems. 21st International Symposium, ISMIS 2014. Proceedings: LNCS 8502, P83, DOI 10.1007/978-3-319-08326-1_9
[19]  
Fournier-Viger Philippe., 2017, DATA SCI PATTERN REC, V1, P54, DOI DOI 10.1007/978-3-030-04921-8_4
[20]  
Gan W.S., 2019, Ph.D. Dissertation