Mining top-k frequent-regular closed patterns

被引:20
作者
Amphawan, Komate [1 ]
Lenca, Philippe [2 ]
机构
[1] Burapha Univ, Computat Innovat Lab, Informat, Bangkok, Thailand
[2] CNRS, Inst Mines Telecom, Telecom Bretagne, UMR 6285,LabSTICC, Bretagne, France
关键词
Frequent pattern; Regular pattern; Closed pattern; Bit-vector; ITEMSETS; BITTABLEFI; ALGORITHM;
D O I
10.1016/j.eswa.2015.06.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent-regular pattern mining has attracted recently many works. Most of the approaches focus on discovering a complete set of patterns under the user-given support and regularity threshold constraints. This leads to several quantitative and qualitative drawbacks. First, it is often difficult to set appropriate support threshold. Second, algorithms produce a huge number of patterns, many of them being redundant. Third, most of the patterns are of very small size and it is arduous to extract interesting relationship among items. To reduce the number of patterns a common solution is to consider the desired number k of outputs and to mine the top-k patterns. In addition, this approach does not require to set a support threshold. To cope with redundancy and interestingness relationship among items, we suggest to focus on closed patterns and introduce a minimal length constraint. We thus propose to mine the top-k frequent-regular closed patterns with minimal length. An efficient single-pass algorithm, called TFRC-Mine, and a new compact bit-vector representation which allows to prune uninteresting candidate, are designed. Experiments show that the proposed algorithm is efficient to produce longer - non redundant - patterns, and that the new data representation is efficient for both computational time and memory usage. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:7882 / 7894
页数:13
相关论文
共 50 条
  • [21] Efficient Approach for Mining Top-k Strong Patterns in Social Network Service
    Bakariya, Brijesh
    Chaturvedi, Kapil
    Singh, Krishna Pratap
    Thakur, G. S.
    2016 FIFTH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS), 2016, : 104 - 108
  • [22] PrefixFPM: a parallel framework for general-purpose mining of frequent and closed patterns
    Yan, Da
    Qu, Wenwen
    Guo, Guimu
    Wang, Xiaoling
    Zhou, Yang
    VLDB JOURNAL, 2022, 31 (02) : 253 - 286
  • [23] Mining Top-K Periodic-Frequent Pattern from Transactional Databases without Support Threshold
    Amphawan, Komate
    Lenca, Philippe
    Surarerks, Athasit
    ADVANCES IN INFORMATION TECHNOLOGY, PROCEEDINGS, 2009, 55 : 18 - +
  • [24] Mining Top-k High Average-Utility Sequential Patterns for Resource Transformation
    Cao, Kai
    Duan, Yucong
    APPLIED SCIENCES-BASEL, 2023, 13 (22):
  • [25] Finding Top-k Fuzzy Frequent Itemsets from Databases
    Li, Haifeng
    Wang, Yue
    Zhang, Ning
    Zhang, Yuejin
    DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 22 - 30
  • [26] Efficient Index for Retrieving Top-k Most Frequent Documents
    Hon, Wing-Kai
    Shah, Rahul
    Wu, Shih-Bin
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5721 : 182 - +
  • [27] TopUMS: Top-k Utility Mining in Stream Data
    Song, Wei
    Fang, Caiyu
    Gan, Wensheng
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 615 - 622
  • [28] A Fast Algorithm for Mining Top-Rank-k Erasable Closed Patterns
    Ham Nguyen
    Tuong Le
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (02): : 3571 - 3583
  • [29] Mining Regular Closed Patterns in Transactional Databases
    Sreedevi, M.
    Reddy, L. S. S.
    7TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2013), 2013, : 380 - 383
  • [30] Mining Top-K Path Traversal Patterns over Streaming Web Click-Sequences
    Li, Hua-Fu
    Lee, Suh-Yin
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2009, 25 (04) : 1121 - 1133