Mining Top-k High Average-Utility Sequential Patterns for Resource Transformation

被引:2
作者
Cao, Kai [1 ,2 ]
Duan, Yucong [3 ]
机构
[1] Hainan Univ, Sch Cyberspace Secur, Renmin Ave 58, Haikou 570228, Peoples R China
[2] Hainan Univ, Sch Cryptol, Renmin Ave 58, Haikou 570228, Peoples R China
[3] Hainan Univ, Sch Comp Sci & Technol, Renmin Ave 58, Haikou 570228, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 22期
关键词
DIKW graph; resource transformation; sequential pattern; average utility; top-k; pattern mining; EFFICIENT ALGORITHM; ITEMSETS; INFORMATION; KNOWLEDGE;
D O I
10.3390/app132212340
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
High-utility sequential pattern mining (HUSPM) helps researchers find all subsequences that have high utility in a quantitative sequential database. The HUSPM approach appears to be well suited for resource transformation in DIKWP graphs. However, all the extensions of a high-utility sequential pattern (HUSP) also have a high utility that increases with its length. Therefore, it is difficult to obtain diverse patterns of resources. The patterns that consist of many low-utility items can also be a HUSP. In practice, such a long pattern is difficult to analyze. In addition, the low-utility items do not always reflect the interestingness of association rules. High average-utility pattern mining is considered a solution to extract more significant patterns by considering the lengths of patterns. In this paper, we formulate the problem of top-k high average-utility sequential pattern mining (HAUSPM) and propose a novel algorithm for resource transformation. We adopt a projection mechanism to improve efficiency. We also adopt the sequence average-utility-raising strategy to increase thresholds. We design the prefix extension average utility and the reduced sequence average utility by incorporating the average utility into the utility upper bounds. The results of our comparative experiments demonstrate that the proposed algorithm can achieve sufficiently good performance.
引用
收藏
页数:27
相关论文
共 67 条
[1]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[2]   A Novel Approach for Mining High-Utility Sequential Patterns in Sequence Databases [J].
Ahmed, Chowdhury Farhan ;
Tanbeer, Syed Khairuzzaman ;
Jeong, Byeong-Soo .
ETRI JOURNAL, 2010, 32 (05) :676-686
[3]   CRoM and HuspExt: Improving Efficiency of High Utility Sequential Pattern Extraction [J].
Alkan, Oznur Kirmemis ;
Karagoz, Pinar .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) :2645-2657
[4]  
AYRES J., 2002, P 8 ACM SIGKDD INT C, P429, DOI DOI 10.1145/775047.775109
[5]   An efficient algorithm for mining frequent sequences by a new strategy without support counting [J].
Chiu, DY ;
Wu, YH ;
Chen, ALP .
20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, :375-386
[6]  
Chunkai Zhang, 2019, 2019 IEEE 21st International Conference on High Performance Computing and Communications
[7]  
IEEE 17th International Conference on Smart City
[8]  
IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Proceedings, P2798, DOI 10.1109/HPCC/SmartCity/DSS.2019.00392
[9]  
Duan Y., 2023, P 22 INT C INT SOFTW, DOI 10.3233/FAIA230226
[10]  
Duan Y., 2023, DIKWP Artificial Consciousness Hypothesis, Nature and Principles (Empirical Description)