An Efficient Method for Mining Top-K Closed Sequential Patterns

被引:10
作者
Pham, Thi-Thiet [1 ]
Do, Tung [2 ]
Nguyen, Anh [3 ]
Vo, Bay [4 ]
Hong, Tzung-Pei [5 ,6 ]
机构
[1] Ind Univ Ho Chi Minh City, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[2] Van Lang Univ, Fac Basic Sci, Ho Chi Minh City 700000, Vietnam
[3] Duy Tan Univ, Inst Res & Dev, Da Nang 550000, Vietnam
[4] Ho Chi Minh City Univ Technol HUTECH, Fac Informat Technol, Ho Chi Minh City 700000, Vietnam
[5] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung 811, Taiwan
[6] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung 804, Taiwan
关键词
Closed sequential pattern; data mining; sequential pattern; top-k sequential patterns; FREQUENT PATTERNS; ALGORITHMS;
D O I
10.1109/ACCESS.2020.3004528
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of exploiting Closed Sequential Patterns (CSPs) is an essential task in data mining, with many different applications. It is used to resolve the situations of huge databases or low minimum support (minsup) thresholds in mining sequential patterns. However, it is challenging and needs a lot of time to customize the minsup values for generating appropriate numbers of CSPs desired by users. To conquer this issue, the TSP algorithm for mining top-k CSPs was previously proposed, with k being a given parameter. The algorithm would return the k CSPs which have the highest support values in a database. However, its execution time and memory usage were high. In this paper, an algorithm named TKCS (Top-K Closed Sequences) is proposed to mine the top-k CSPs efficiently. To improve the execution time and memory usage, it uses a vertical bitmap database to represent data. Besides, it adopts some useful strategies in the process of exploiting the top-k CSPs such as: always choosing the sequential patterns with the greatest support values for generating candidate patterns and storing top-k CSPs in an ascending order of the support values to increase the minsup value more quickly. The empirical results show that TKCS has better performance than TSP for discovering the top-k CSPs in terms of both runtime and memory usage.
引用
收藏
页码:118156 / 118163
页数:8
相关论文
共 50 条
  • [41] Mining Cross-Level Closed Sequential Patterns
    Aman, Rutba
    Ahmed, Chowdhury Farhan
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 199 - 214
  • [42] Efficient Top-k Retrieval on Massive Data
    Han, Xixian
    Li, Jianzhong
    Gao, Hong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (10) : 2687 - 2699
  • [43] Efficient top-k aggregation of ranked inputs
    Mamoulis, Nikos
    Yiu, Man Lung
    Cheng, Kit Hung
    Cheung, David W.
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2007, 32 (03):
  • [44] Efficient Top-k Closeness Centrality Search
    Olsen, Paul W., Jr.
    Labouseur, Alan G.
    Hwang, Jeong-Hyon
    2014 IEEE 30TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2014, : 196 - 207
  • [45] An Improved Algorithm for Mining Top-k Association Rules
    Nguyen, Linh T. T.
    Nguyen, Loan T. T.
    Bay Vo
    ADVANCED COMPUTATIONAL METHODS FOR KNOWLEDGE ENGINEERING, ICCSAMA 2017, 2018, 629 : 117 - 128
  • [46] A Fast Algorithm for Mining Top-Rank-k Erasable Closed Patterns
    Ham Nguyen
    Tuong Le
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3571 - 3583
  • [47] Mining top-k frequent closed itemsets over data streams using the sliding window model
    Tsai, Pauray S. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 6968 - 6973
  • [48] An efficient algorithm for mining top-rank-k frequent patterns
    Thu-Lan Dam
    Kenli Li
    Philippe Fournier-Viger
    Quang-Huy Duong
    Applied Intelligence, 2016, 45 : 96 - 111
  • [49] e-RNSP: An Efficient Method for Mining Repetition Negative Sequential Patterns
    Dong, Xiangjun
    Gong, Yongshun
    Cao, Longbing
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (05) : 2084 - 2096
  • [50] k-PFPMiner: Top-k Periodic Frequent Patterns in Big Temporal Databases
    Likhitha, Palla
    Ravikumar, Penugonda
    Saxena, Deepika
    Kiran, Rage Uday
    Watanobe, Yutaka
    IEEE ACCESS, 2023, 11 : 119033 - 119044