Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data

被引:2
|
作者
Cuzzocrea, Alfredo [1 ,2 ]
Leung, Carson K. [3 ]
MacKinnon, Richard Kyle [3 ]
机构
[1] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, TS, Italy
[2] ICAR CNR, I-34127 Trieste, TS, Italy
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015 | 2015年 / 60卷
关键词
Knowledge discovery and data mining; expected support; frequent patterns; uncertain data; upper bounds;
D O I
10.1016/j.procs.2015.08.195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery and data mining generally discovers implicit, previously unknown, and useful knowledge from data. As one of the popular knowledge discovery and data mining tasks, frequent itemset mining, in particular, discovers knowledge in the form of sets of frequently co-occurring items, events, or objects. On the one hand, in many real-life applications, users mine frequent patterns from traditional databases of precise data, in which users know certainly the presence of items in transactions. On the other hand, in many other real-life applications, users mine frequent itemsets from probabilistic sets of uncertain data, in which users are uncertain about the likelihood of the presence of items in transactions. Each item in these probabilistic sets of uncertain data is often associated with an existential probability expressing the likelihood of its presence in that transaction. To mine frequent itemsets from these probabilistic datasets, many existing algorithms capture lots of information to compute expected support. To reduce the amount of space required, algorithms capture some but not all information in computing or approximating expected support. The tradeoff is that the upper bounds to expected support may not be tight. In this paper, we examine several upper bounds and recommend to the user which ones consume less space while providing good approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 35 条
  • [31] Fuzzy Association Rule Mining based Frequent Pattern Extraction from Uncertain Data
    Rajput, D. S.
    Thakur, R. S.
    Thakur, G. S.
    PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 709 - 714
  • [32] Mining Top-k Frequent-regular Itemsets from Data Streams Based on Sliding Window Technique
    Mesama, Tashinee
    Amphawan, Komate
    2018 5TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATICS: CONCEPTS, THEORY AND APPLICATIONS (ICAICTA 2018), 2018, : 224 - 230
  • [33] A Landmark-Model Based System for Mining Frequent Patterns from Uncertain Data Streams
    Leung, Carson Kai-Sang
    Jiang, Fan
    Hayduk, Yaroslav
    PROCEEDINGS OF THE 15TH INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM (IDEAS '11), 2011, : 249 - 250
  • [34] ITUFP: A fast method for interactive mining of Top-K frequent patterns from uncertain data
    Davashi, Razieh
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 214
  • [35] UP-tree & UP-Mine: A fast method based on upper bound for frequent pattern mining from uncertain data
    Davashi, Razieh
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2021, 106