ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data

被引:6
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern; Uncertain data; Interactive mining; ITEMSETS; TREE; THRESHOLD; SUPPORT;
D O I
10.1016/j.eswa.2022.119156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] An accurate method for mining top-k frequent pattern under differential privacy
    Zhang, Xiaojian
    Wang, Miao
    Meng, Xiaofeng
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2014, 51 (01): : 104 - 114
  • [22] Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns
    Huang, Jen-Wei
    Jaysawal, Bijay Prasad
    Chen, Kuan-Ying
    Wu, Yong-Bin
    KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 61 (03) : 1331 - 1359
  • [23] Mining Top-k Frequent Patterns in Large Geosocial Networks: A Mnie-Based Extension Approach
    Zhou, Changben
    Xu, Jian
    Jiang, Ming
    Tang, Donghang
    Wang, Sheng
    IEEE ACCESS, 2023, 11 : 27662 - 27675
  • [24] Efficient algorithms of mining top-k frequent closed itemsets
    Lan Yongjie
    Qiu Yong
    ICEMI 2007: PROCEEDINGS OF 2007 8TH INTERNATIONAL CONFERENCE ON ELECTRONIC MEASUREMENT & INSTRUMENTS, VOL II, 2007, : 551 - 554
  • [25] An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams
    Shajib, Md. Badi-Uz-Zaman
    Samiullah, Md.
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 980 - 984
  • [26] Fast Algorithms for Frequent Itemset Mining from Uncertain Data
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    Tanbeer, Syed K.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 893 - 898
  • [27] Mining maximal frequent patterns from univariate uncertain data
    Liu, Ying-Ho
    INTELLIGENT DATA ANALYSIS, 2014, 18 (04) : 653 - 676
  • [28] A dynamic data structure for top-k queries on uncertain data
    Chen, Jiang
    Yi, Ke
    THEORETICAL COMPUTER SCIENCE, 2008, 407 (1-3) : 310 - 317
  • [29] Optimizing Distributed Top-k Queries on Uncertain Data
    Zhao Zhibin
    Yu Yang
    Bao Yubin
    Yu Ge
    2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3209 - 3214
  • [30] On the semantics of top-k ranking for objects with uncertain data
    Wang, Chonghai
    Yuan, Li Yan
    You, Jia-Huai
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2011, 62 (07) : 2812 - 2823