ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data

被引:6
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Fac Comp Engn, Najafabad Branch, Najafabad, Iran
[2] Islamic Azad Univ, Big Data Res Ctr, Najafabad Branch, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern; Uncertain data; Interactive mining; ITEMSETS; TREE; THRESHOLD; SUPPORT;
D O I
10.1016/j.eswa.2022.119156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Top-K Uncertain Frequent Pattern (UFP) mining is an interesting topic in data mining. The existing TUFP algorithm supports static mining of Top-K UFPs; however, in the real world, users need to repeatedly change the K threshold to extract the information according to the requirements of their application. In interactive environments, the TUFP algorithm needs to re-scan the database and create the UP-Lists and CUP-Lists from scratch which is very time-consuming. In this paper, a fast method called ITUFP is proposed for interactive mining of Top-K UFPs. The proposed method uses a new data structure called IMCUP-List to store information of patterns efficiently. It creates the UP-Lists with a single database scan, extracts the patterns by generating IMCUP-Lists, and stores all the lists. When K changes, the proposed algorithm only updates the IMCUP-Lists without having to create the lists from scratch. Accordingly, ITUFP conforms to the "build once, mine many" principle, where the UP-Lists and IMCUP-Lists are created only once and used in mining with different K values. This is the first study on interactive mining of Top-K UFPs. Extensive experimental results with sparse and dense uncertain data prove that the proposed method is very efficient for interactive mining of Top-K UFPs.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Mining top-k frequent patterns from uncertain databases
    Tuong Le
    Bay Vo
    Van-Nam Huynh
    Ngoc Thanh Nguyen
    Baik, Sung Wook
    APPLIED INTELLIGENCE, 2020, 50 (05) : 1487 - 1497
  • [2] Mining top-k frequent patterns from uncertain databases
    Tuong Le
    Bay Vo
    Van-Nam Huynh
    Ngoc Thanh Nguyen
    Sung Wook Baik
    Applied Intelligence, 2020, 50 : 1487 - 1497
  • [3] Interactive mining of top-K frequent closed itemsets from data streams
    Li, Hua-Fu
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (07) : 10779 - 10788
  • [4] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131
  • [5] Mining top-k frequent patterns without minimum support threshold
    Salam, Abdus
    Khayal, M. Sikandar Hayat
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 30 (01) : 57 - 86
  • [6] Mining top-k frequent-regular closed patterns
    Amphawan, Komate
    Lenca, Philippe
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (21) : 7882 - 7894
  • [7] Mining top-k frequent patterns with combination reducing techniques
    Pyun, Gwangbum
    Yun, Unil
    APPLIED INTELLIGENCE, 2014, 41 (01) : 76 - 98
  • [8] Efficient Top-k Frequent Itemset Mining on Massive Data
    Wan, Xiaolong
    Han, Xixian
    DATA SCIENCE AND ENGINEERING, 2024, 9 (02) : 177 - 203
  • [9] Efficient prime-based method for interactive mining of frequent patterns
    Nadimi-Shahraki, Mohammad H.
    Mustapha, Norwati
    Sulaiman, Md. Nasir
    Mamat, Ali
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (10) : 12654 - 12670
  • [10] A Distributed Method for Fast Mining Frequent Patterns From Big Data
    Huang, Peng-Yu
    Cheng, Wan-Shu
    Chen, Ju-Chin
    Chung, Wen-Yu
    Chen, Young-Lin
    Lin, Kawuu W.
    IEEE ACCESS, 2021, 9 : 135144 - 135159