Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data

被引:2
|
作者
Cuzzocrea, Alfredo [1 ,2 ]
Leung, Carson K. [3 ]
MacKinnon, Richard Kyle [3 ]
机构
[1] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, TS, Italy
[2] ICAR CNR, I-34127 Trieste, TS, Italy
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015 | 2015年 / 60卷
关键词
Knowledge discovery and data mining; expected support; frequent patterns; uncertain data; upper bounds;
D O I
10.1016/j.procs.2015.08.195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery and data mining generally discovers implicit, previously unknown, and useful knowledge from data. As one of the popular knowledge discovery and data mining tasks, frequent itemset mining, in particular, discovers knowledge in the form of sets of frequently co-occurring items, events, or objects. On the one hand, in many real-life applications, users mine frequent patterns from traditional databases of precise data, in which users know certainly the presence of items in transactions. On the other hand, in many other real-life applications, users mine frequent itemsets from probabilistic sets of uncertain data, in which users are uncertain about the likelihood of the presence of items in transactions. Each item in these probabilistic sets of uncertain data is often associated with an existential probability expressing the likelihood of its presence in that transaction. To mine frequent itemsets from these probabilistic datasets, many existing algorithms capture lots of information to compute expected support. To reduce the amount of space required, algorithms capture some but not all information in computing or approximating expected support. The tradeoff is that the upper bounds to expected support may not be tight. In this paper, we examine several upper bounds and recommend to the user which ones consume less space while providing good approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 35 条
  • [21] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936
  • [22] uCFS2: An Enhanced System that Mines Uncertain Data for Constrained Frequent Sets
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '10), 2010, : 32 - 37
  • [23] Knowledge discovery of customer purchasing intentions by plausible-frequent itemsets from uncertain data
    Weng, Cheng-Hsiung
    Huang, Tony Cheng-Kui
    APPLIED INTELLIGENCE, 2015, 43 (03) : 598 - 613
  • [24] Knowledge discovery of customer purchasing intentions by plausible-frequent itemsets from uncertain data
    Cheng-Hsiung Weng
    Tony Cheng-Kui Huang
    Applied Intelligence, 2015, 43 : 598 - 613
  • [25] Hyper-structure mining of frequent patterns in uncertain data streams
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, Jaehwan John
    Tu, Yi-cheng
    KNOWLEDGE AND INFORMATION SYSTEMS, 2013, 37 (01) : 219 - 244
  • [26] An Efficient Approach for Mining Frequent Patterns over Uncertain Data Streams
    Shajib, Md. Badi-Uz-Zaman
    Samiullah, Md.
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 980 - 984
  • [27] Hyper-structure mining of frequent patterns in uncertain data streams
    Chandima HewaNadungodage
    Yuni Xia
    Jaehwan John Lee
    Yi-cheng Tu
    Knowledge and Information Systems, 2013, 37 : 219 - 244
  • [28] Privacy-Preserving Frequent Pattern Mining from Big Uncertain Data
    Leung, Carson K.
    Hoi, Calvin S. H.
    Pazdor, Adam G. M.
    Wodi, Bryan H.
    Cuzzocrea, Alfredo
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5101 - 5110
  • [29] Frequent pattern mining algorithm for uncertain data streams based on sliding window
    Yang, Junrui
    Yang, Cai
    Wei, Yanjun
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 265 - 268
  • [30] Item-centric mining of frequent patterns from big uncertain data
    Braun, Peter
    Cuzzocrea, Alfredo
    Leung, Carson K.
    Pazdor, Adam G. M.
    Souza, Joglas
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 1875 - 1884