Efficient algorithms for mining top-rank-k erasable patterns using pruning strategies and the subsume concept

被引:29
作者
Tuong Le [1 ]
Bay Vo [2 ,3 ]
Baik, Sung Wook [1 ]
机构
[1] Sejong Univ, Digital Contents Res Inst, Seoul, South Korea
[2] Ton Duc Thang Univ, Div Data Sci, Ho Chi Minh City, Vietnam
[3] Ton Duc Thang Univ, Fac Informat Technol, Ho Chi Minh City, Vietnam
关键词
Data mining; Erasable patterns; Pattern mining; Top-rank-k; HIGH UTILITY ITEMSETS; FREQUENT ITEMSETS; LISTS;
D O I
10.1016/j.engappai.2017.09.010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mining erasable patterns (EPs) is one of the emerging tasks in data mining which helps factory managers to establish plans for the development of new systems of products. However, systems usually face the problem of many EPs. Therefore, the problem of mining top-rank-k EPs, and an algorithm for mining these using the PID_List structure named VM, were proposed in 2013. In this paper, we propose two efficient methods, named the TEP (Top-rank-k Erasable Pattern mining) and TEPUS (Top-rank-k Erasable Pattern mining Using the Subsume concept) algorithms, for mining top-rank-k EPs. The TEP algorithm uses the dPidset structure to reduce the memory usage and a dynamic threshold pruning strategy to accelerate the mining process. The TEPUS algorithm is the extension of the TEP algorithm using the subsume concept and the index strategy to further speed up the mining time and reduce the memory usage. Finally, we conduct an experiment to compare the mining time, memory usage and scalability of TEP, TEPUS and two state-of-the-art algorithms (VM and dVM) for mining top-rank-k EPs. Our performance studies show that TEPUS outperforms TEP, VM and dVM. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 29 条
  • [21] Top-k high average-utility itemsets mining with effective pruning strategies
    Ronghui Wu
    Zhan He
    Applied Intelligence, 2018, 48 : 3429 - 3445
  • [22] An Efficient Method for Mining Top-K Closed Sequential Patterns
    Pham, Thi-Thiet
    Do, Tung
    Nguyen, Anh
    Vo, Bay
    Hong, Tzung-Pei
    IEEE ACCESS, 2020, 8 : 118156 - 118163
  • [23] Efficient mining of top-k high utility itemsets through genetic algorithms
    Luna, Jose Maria
    Kiran, Rage Uday
    Fournier-Viger, Philippe
    Ventura, Sebastian
    INFORMATION SCIENCES, 2023, 624 : 529 - 553
  • [24] Fast Top-K association rule mining using rule generation property pruning
    Xiangyu Liu
    Xinzheng Niu
    Philippe Fournier-Viger
    Applied Intelligence, 2021, 51 : 2077 - 2093
  • [25] Efficient Approach for Mining Top-k Strong Patterns in Social Network Service
    Bakariya, Brijesh
    Chaturvedi, Kapil
    Singh, Krishna Pratap
    Thakur, G. S.
    2016 FIFTH INTERNATIONAL CONFERENCE ON ECO-FRIENDLY COMPUTING AND COMMUNICATION SYSTEMS (ICECCS), 2016, : 104 - 108
  • [26] Efficient mining top-k high utility itemsets in incremental databases based on threshold raising strategies and pre-large concept
    Tung, N. T.
    Nguyen, Loan T. T.
    Nguyen, Trinh D. D.
    Huynh, Bao
    KNOWLEDGE-BASED SYSTEMS, 2025, 315
  • [27] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Li, Zelin
    Li, Gufeng
    Zhao, Le
    Shang, Tao
    APPLIED INTELLIGENCE, 2023, 53 (21) : 25678 - 25696
  • [28] List-based mining top-k average-utility itemsets with effective pruning and threshold raising strategies
    Zelin Li
    Gufeng Li
    Le Zhao
    Tao Shang
    Applied Intelligence, 2023, 53 : 25678 - 25696
  • [29] Enhancing Threshold-Raising Strategies for Effective Mining Top-k High Utility Patterns
    Bac Le
    Cao Truong
    Minh-Thai Tran
    2017 4TH NAFOSTED CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2017, : 78 - 83