TK-RNSP: Efficient Top-K Repetitive Negative Sequential Pattern mining

被引:0
|
作者
Lan, Dun [1 ,2 ]
Sun, Chuanhou [1 ,2 ]
Dong, Xiangjun [1 ,2 ]
Qiu, Ping [3 ]
Gong, Yongshun [4 ]
Liu, Xinwang [5 ]
Fournier-Viger, Philippe [6 ]
Zhang, Chengqi [7 ]
机构
[1] Qilu Univ Technol, Shandong Comp Ctr, Key Lab Comp Power Network & Informat Secur, Natl Supercomp Ctr Jinan,Shandong Acad Sci,Minist, Jinan 250353, Peoples R China
[2] Shandong Fundamental Res Ctr Comp Sci, Shandong Prov Key Lab Comp Power Internet & Serv C, Jinan 250101, Peoples R China
[3] Nanjing Univ Posts & Telecommun, Sch Internet Things, Nanjing 210023, Peoples R China
[4] Shandong Univ, Sch Software, Jinan 250100, Peoples R China
[5] Natl Univ Def Technol, Sch Comp, Changsha 410003, Hunan, Peoples R China
[6] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[7] Hong Kong Polytech Univ, Dept Data Sci & Artificial Intelligence, Hong Kong 100872, Peoples R China
基金
中国国家自然科学基金;
关键词
Sequential pattern mining; Negative sequential pattern; Top-K repetitive negative sequential patterns; Nonoverlapping; SEQUENCES; FREQUENT;
D O I
10.1016/j.ipm.2025.104077
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Repetitive Negative Sequential Patterns (RNSPs) can provide critical insights into the importance of sequences. However, most current RNSP mining methods require users to set an appropriate support threshold to obtain the expected number of patterns, which is a very difficult task for the users without prior experience. To address this issue, we propose a new algorithm, TK-RNSP, to mine the Top-K RNSPs with the highest support, without the need to set a support threshold. In detail, we achieve a significant breakthrough by proposing a series of definitions that enable RNSP mining to satisfy anti-monotonicity. Then, we propose a bitmapbased Depth-First Backtracking Search (DFBS) strategy to decrease the heavy computational burden by increasing the speed of support calculation. Finally, we propose the algorithm TKRNSP in an one-stage process, which can effectively reduce the generation of unnecessary patterns and improve computational efficiency comparing to those two-stage process algorithms. To the best of our knowledge, TK-RNSP is the first algorithm to mine Top-K RNSPs. Extensive experiments on eight datasets show that TK-RNSP has better flexibility and efficiency to mine Top-K RNSPs.
引用
收藏
页数:25
相关论文
共 28 条
  • [21] Cache-Efficient Top-k Aggregation over High Cardinality Large Datasets
    Siddiqui, Tarique
    Narasayya, Vivek
    Dumitru, Marius
    Chaudhuri, Surajit
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 17 (04): : 644 - 656
  • [22] CCSMP: an efficient closed contiguous sequential pattern mining algorithm with a pattern relation graph
    Hu, Haichuan
    Zhang, Jingwei
    Xia, Ruiqing
    Liu, Shichao
    APPLIED INTELLIGENCE, 2023, 53 (24) : 29723 - 29740
  • [23] CCSMP: an efficient closed contiguous sequential pattern mining algorithm with a pattern relation graph
    Haichuan Hu
    Jingwei Zhang
    Ruiqing Xia
    Shichao Liu
    Applied Intelligence, 2023, 53 : 29723 - 29740
  • [24] On Selection of Efficient Sequential Pattern Mining Algorithm Based on Characteristics of Data
    Peschel, Jakub
    Batko, Michal
    Zezula, Pavel
    2022 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM), 2022, : 202 - 205
  • [25] An Efficient GA-Based Algorithm for Mining Negative Sequential Patterns
    Zheng, Zhigang
    Zhao, Yanchang
    Zuo, Ziye
    Cao, Longbing
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT I, PROCEEDINGS, 2010, 6118 : 262 - 273
  • [26] Bubble Sketch: A High-performance and Memory-efficient Sketch for Finding Top-k Items in Data Streams
    Cao, Lu
    Shi, Qilong
    Liu, Yuxi
    Zheng, Hanyue
    Xin, Yao
    Li, Wenjun
    Yang, Tong
    Wang, Yangyang
    Xu, Yang
    Zhang, Weizhe
    Xu, Mingwei
    PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2024, 2024, : 3653 - 3657
  • [27] An Efficient Web Recommender System based on Approach of Mining Frequent Sequential Pattern from Customized Web Log Preprocessing
    Valera, Manisha
    Chauhan, Uttam
    2013 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATIONS AND NETWORKING TECHNOLOGIES (ICCCNT), 2013,
  • [28] Efficient constraint-based Sequential Pattern Mining (SPM) algorithm to understand customers' buying behaviour from time stamp-based sequence dataset
    Kumar, Niti Ashish
    Ganatra, Amit
    COGENT ENGINEERING, 2015, 2 (01):