ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data

被引:6
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern; Uncertain data; Interactive mining; ITEMSETS; TREE; THRESHOLD; SUPPORT;
D O I
10.1016/j.eswa.2022.119156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] TFP: An efficient algorithm for mining top-K frequent closed itemsets
    Wang, JY
    Han, JW
    Lu, Y
    Tzvetkov, P
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (05) : 652 - 664
  • [32] Customized frequent patterns mining algorithms for enhanced Top-Rank-K frequent pattern mining
    Abdelaal, Areej Ahmad
    Abed, Sa'ed
    Al-Shayeji, Mohammad
    Allaho, Mohammad
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169 (169)
  • [33] A New Fast Vertical Method for Mining Frequent Patterns
    Deng Z.
    Wang Z.
    International Journal of Computational Intelligence Systems, 2010, 3 (06) : 733 - 744
  • [34] A New Fast Vertical Method for Mining Frequent Patterns
    Deng, Zhihong
    Wang, Zhonghui
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2010, 3 (06): : 733 - 744
  • [35] Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns
    Jen-Wei Huang
    Bijay Prasad Jaysawal
    Kuan-Ying Chen
    Yong-Bin Wu
    Knowledge and Information Systems, 2019, 61 : 1331 - 1359
  • [36] A Scalable Data Analytics Algorithm for Mining Frequent Patterns from Uncertain Data
    MacKinnon, Richard Kyle
    Leung, Carson Kai-Sang
    Tanbeer, Syed K.
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 404 - 416
  • [37] Mining top-rank-k frequent patterns
    Deng, Zhi-Hong
    Fang, Guo-Dong
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 851 - 856
  • [38] Incremental mining maximal frequent patterns from univariate uncertain data
    Fasihy, Hanieh
    Shahraki, Mohammad Hossein Nadimi
    KNOWLEDGE-BASED SYSTEMS, 2018, 152 : 40 - 50
  • [39] TopUMS: Top-k Utility Mining in Stream Data
    Song, Wei
    Fang, Caiyu
    Gan, Wensheng
    21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 615 - 622
  • [40] UP-tree & UP-Mine: A fast method based on upper bound for frequent pattern mining from uncertain data
    Davashi, Razieh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106