ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data

被引:6
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern; Uncertain data; Interactive mining; ITEMSETS; TREE; THRESHOLD; SUPPORT;
D O I
10.1016/j.eswa.2022.119156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Vertical Frequent Pattern Mining from Uncertain Data
    Budhia, Bhavek P.
    Cuzzocrea, Alfredo
    Leung, Carson K.
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 1273 - 1282
  • [42] Skopus: Mining top-k sequential patterns under leverage
    Petitjean, Francois
    Li, Tao
    Tatti, Nikolaj
    Webb, Geoffrey I.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (05) : 1086 - 1111
  • [43] Skopus: Mining top-k sequential patterns under leverage
    François Petitjean
    Tao Li
    Nikolaj Tatti
    Geoffrey I. Webb
    Data Mining and Knowledge Discovery, 2016, 30 : 1086 - 1111
  • [44] Mining top-k approximate closed patterns in an imprecise database
    Yu, Xiaomei
    Wang, Hong
    Zheng, Xiangwei
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2018, 9 (02) : 97 - 107
  • [45] Hyper-structure mining of frequent patterns in uncertain data streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, Jaehwan John
    Tu, Yi-cheng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 219 - 244
  • [46] An Efficient Optimization Approach for Top-k Queries on Uncertain Data
    Zhang, Zhiqiang
    Wei, Xiaoyan
    Xie, Xiaoqin
    Pan, Haiwei
    Miao, Yu
    INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2018, 27 (01)
  • [47] TKFIM: Top-K frequent itemset mining technique based on equivalence classes
    Iqbal, Saood
    Shahid, Abdul
    Roman, Muhammad
    Khan, Zahid
    Al-Otaibi, Shaha
    Yu, Lisu
    PEERJ COMPUTER SCIENCE, 2021, : 1 - 27
  • [48] Mining top-k frequent closed itemsets over data streams using the sliding window model
    Tsai, Pauray S. M.
    EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) : 6968 - 6973
  • [49] Fast mining Top-Rank-k frequent patterns by using Node-lists
    Deng, Zhi-Hong
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1763 - 1768
  • [50] One Database Pass Algorithms of Mining Top-k Frequent Closed Itemsets
    Qiu, Yong
    Lan, Yong-Jie
    ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, : 828 - 833