Mining top-k frequent-regular closed patterns

被引:20
作者
Amphawan, Komate [1 ]
Lenca, Philippe [2 ]
机构
[1] Burapha Univ, Computat Innovat Lab, Informat, Bangkok, Thailand
[2] CNRS, Inst Mines Telecom, Telecom Bretagne, UMR 6285,LabSTICC, Bretagne, France
关键词
Frequent pattern; Regular pattern; Closed pattern; Bit-vector; ITEMSETS; BITTABLEFI; ALGORITHM;
D O I
10.1016/j.eswa.2015.06.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent-regular pattern mining has attracted recently many works. Most of the approaches focus on discovering a complete set of patterns under the user-given support and regularity threshold constraints. This leads to several quantitative and qualitative drawbacks. First, it is often difficult to set appropriate support threshold. Second, algorithms produce a huge number of patterns, many of them being redundant. Third, most of the patterns are of very small size and it is arduous to extract interesting relationship among items. To reduce the number of patterns a common solution is to consider the desired number k of outputs and to mine the top-k patterns. In addition, this approach does not require to set a support threshold. To cope with redundancy and interestingness relationship among items, we suggest to focus on closed patterns and introduce a minimal length constraint. We thus propose to mine the top-k frequent-regular closed patterns with minimal length. An efficient single-pass algorithm, called TFRC-Mine, and a new compact bit-vector representation which allows to prune uninteresting candidate, are designed. Experiments show that the proposed algorithm is efficient to produce longer - non redundant - patterns, and that the new data representation is efficient for both computational time and memory usage. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7882 / 7894
页数:13
相关论文
共 50 条
  • [31] TKIFRPM: A Novel Approach for Topmost-K Identical Frequent Regular Patterns Mining from Incremental Datasets
    Rehman, Saif Ur
    Khan, Muhammad Altaf
    Nabi, Habib Un
    Ali, Shaukat
    Alnazzawi, Noha
    Khan, Shafiullah
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [32] An N-list-based algorithm for mining frequent closed patterns
    Tuong Le
    Bay Vo
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (19) : 6648 - 6657
  • [33] An Improving Approach for Top-Rank-k Frequent Pattern Mining
    Nguyen, Ham
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2022, 09 (04) : 417 - 433
  • [34] Discovering Top-k Profitable Patterns for Smart Manufacturing
    Wan, Shicheng
    Chen, Jiahui
    Zhang, Peifeng
    Gan, Wensheng
    Gu, Tianlong
    COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2022, WWW 2022 COMPANION, 2022, : 956 - 964
  • [35] Mining frequent closed inter-sequence patterns efficiently using dynamic bit vectors
    Bac Le
    Minh-Thai Tran
    Bay Vo
    APPLIED INTELLIGENCE, 2015, 43 (01) : 74 - 84
  • [36] Mining Top-k motifs with a SAT-based framework
    Jabbour, Said
    Sais, Lakhdar
    Salhi, Yakoub
    ARTIFICIAL INTELLIGENCE, 2017, 244 : 30 - 47
  • [37] Fast mining Top-Rank-k frequent patterns by using Node-lists
    Deng, Zhi-Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1763 - 1768
  • [38] An Efficient Algorithm For Mining Top-Rank-K Frequent Patterns From Uncertain Databases
    Goyal, Neha
    Jain, S. K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2016, : 324 - 328
  • [39] Mining the dominant patterns of customer shifts between segments by using top-k and distinguishing sequential rules
    Akhondzadeh-Noughabi, Elham
    Albadvi, Amir
    MANAGEMENT DECISION, 2015, 53 (09) : 1976 - 2003
  • [40] PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns
    Da Yan
    Wenwen Qu
    Guimu Guo
    Xiaoling Wang
    Yang Zhou
    The VLDB Journal, 2022, 31 : 253 - 286