Approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data

被引:2
|
作者
Cuzzocrea, Alfredo [1 ,2 ]
Leung, Carson K. [3 ]
MacKinnon, Richard Kyle [3 ]
机构
[1] Univ Trieste, Dept Engn & Architecture DIA, I-34127 Trieste, TS, Italy
[2] ICAR CNR, I-34127 Trieste, TS, Italy
[3] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
来源
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 19TH ANNUAL CONFERENCE, KES-2015 | 2015年 / 60卷
关键词
Knowledge discovery and data mining; expected support; frequent patterns; uncertain data; upper bounds;
D O I
10.1016/j.procs.2015.08.195
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Knowledge discovery and data mining generally discovers implicit, previously unknown, and useful knowledge from data. As one of the popular knowledge discovery and data mining tasks, frequent itemset mining, in particular, discovers knowledge in the form of sets of frequently co-occurring items, events, or objects. On the one hand, in many real-life applications, users mine frequent patterns from traditional databases of precise data, in which users know certainly the presence of items in transactions. On the other hand, in many other real-life applications, users mine frequent itemsets from probabilistic sets of uncertain data, in which users are uncertain about the likelihood of the presence of items in transactions. Each item in these probabilistic sets of uncertain data is often associated with an existential probability expressing the likelihood of its presence in that transaction. To mine frequent itemsets from these probabilistic datasets, many existing algorithms capture lots of information to compute expected support. To reduce the amount of space required, algorithms capture some but not all information in computing or approximating expected support. The tradeoff is that the upper bounds to expected support may not be tight. In this paper, we examine several upper bounds and recommend to the user which ones consume less space while providing good approximation to expected support of frequent itemsets in mining probabilistic sets of uncertain data. (C) 2015 The Authors. Published by Elsevier B.V.
引用
收藏
页码:613 / 622
页数:10
相关论文
共 35 条
  • [1] Analyzing Expected Support-based Frequent Itemsets over Uncertain Data
    Chen, Fengjuan
    Qu, Wenyu
    Li, Zhiyang
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1721 - 1725
  • [2] Skip Search Approach for Mining Probabilistic Frequent Itemsets from Uncertain Data
    Shintani, Takahiko
    Ohmori, Tadashi
    Fujita, Hideyuki
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 174 - 180
  • [3] Uncertain Frequent Itemsets Mining Algorithm on Data Streams with Constraints
    Yu, Qun
    Tang, Ke-Ming
    Tang, Shi-Xi
    Lv, Xin
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 192 - 201
  • [4] Efficient mining algorithm of frequent itemsets for uncertain data streams
    Wang Qianqian
    Liu Fang-ai
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 443 - 446
  • [5] Tightening upper bounds to the expected support for uncertain frequent pattern mining
    Leung, Carson K.
    MacKinnon, Richard Kyle
    Tanbeer, Syed K.
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS 18TH ANNUAL CONFERENCE, KES-2014, 2014, 35 : 328 - 337
  • [6] Mining Frequent Itemsets in Correlated Uncertain Databases
    Yong-Xin Tong
    Lei Chen
    Jieying She
    Journal of Computer Science and Technology, 2015, 30 : 696 - 712
  • [7] Mining Frequent Itemsets in Correlated Uncertain Databases
    Tong, Yong-Xin
    Chen, Lei
    She, Jieying
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 696 - 712
  • [8] Comprehensive mining of frequent itemsets for a combination of certain and uncertain databases
    Wazir S.
    Beg M.M.S.
    Ahmad T.
    International Journal of Information Technology, 2020, 12 (4) : 1205 - 1216
  • [9] Efficient Probabilistic Frequent Itemset Mining in Big Sparse Uncertain Data
    Xu, Jing
    Li, Ning
    Mao, Xiao-Jiao
    Yang, Yu-Bin
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 235 - 247
  • [10] Discovering probabilistic frequent closed itemsets in uncertain database with tuple uncertainty
    Chen, Fengjuan
    Qu, Wenyu
    Nie, Lihai
    Wu, Junfeng
    Li, Yuanyuan
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2016, 31 (02): : 109 - 116