Fast Algorithms for Frequent Itemset Mining from Uncertain Data

被引:35
作者
Leung, Carson Kai-Sang [1 ]
MacKinnon, Richard Kyle [1 ]
Tanbeer, Syed K. [1 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB, Canada
来源
2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM) | 2014年
关键词
Association analysis; data mining algorithms; expected support; frequent patterns; tree structures; uncertain data;
D O I
10.1109/ICDM.2014.146
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The majority of existing data mining algorithms mine frequent itemsets from precise data. A well-known algorithm is FP-growth, which builds a compact FP-tree structure to capture important contents of precise data and mines frequent itemsets from the FP-tree. However, there are situations in which data are uncertain. To capture important contents (e.g., existential probabilities) of uncertain data for mining frequent itemsets, the UF-growth algorithm uses a UF-tree structure. However, the UF-tree can be large. Other tree structures for handling uncertain data may achieve compactness at the expense of looser upper bounds on expected supports. To solve this problem, we propose fast algorithms that use compact tree structures for capturing uncertain data with tightened upper bounds to expected support (tube) for frequent itemset mining from uncertain data. Experimental results show the tightness of tube provided by our algorithms and the compactness of our tree structures.
引用
收藏
页码:893 / 898
页数:6
相关论文
共 50 条
  • [1] Efficient Probabilistic Frequent Itemset Mining in Big Sparse Uncertain Data
    Xu, Jing
    Li, Ning
    Mao, Xiao-Jiao
    Yang, Yu-Bin
    PRICAI 2014: TRENDS IN ARTIFICIAL INTELLIGENCE, 2014, 8862 : 235 - 247
  • [2] Frequent Itemset Mining for a Combination of Certain and Uncertain Databases
    Wazir, Samar
    Ahmad, Tanvir
    Beg, M. M. Sufyan
    RECENT DEVELOPMENTS AND THE NEW DIRECTION IN SOFT-COMPUTING FOUNDATIONS AND APPLICATIONS, 2018, 361 : 25 - 39
  • [3] An Improved Vertical Algorithm for Frequent Itemset Mining from Uncertain Database
    Yang, Junrui
    Zhang, Yingjie
    Wei, Yanjun
    2017 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC 2017), VOL 1, 2017, : 355 - 358
  • [4] A Review of Frequent Pattern Mining Algorithms for Uncertain Data
    Bhogadhi, Vani
    Chandak, M. B.
    PROCEEDINGS OF SAI INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS) 2016, VOL 2, 2018, 16 : 974 - 983
  • [5] Review of Algorithm for Mining Frequent Patterns from Uncertain Data
    Yue, Liwen
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2015, 15 (06): : 17 - 21
  • [6] Weighted Frequent Itemset Mining Using OWA on Uncertain Transactional Database
    Wazir, Samar
    Beg, M. M. Sufyan
    Ahmad, Tanvir
    DATA COMMUNICATION AND NETWORKS, GUCON 2019, 2020, 1049 : 183 - 193
  • [7] DISC: Efficient Uncertain Frequent Pattern Mining with Tightened Upper Bounds
    MacKinnon, Richard Kyle
    Strauss, Teagan D.
    Leung, Carson Kai-Sang
    2014 IEEE International Conference on Data Mining Workshop (ICDMW), 2014, : 1038 - 1045
  • [8] Finding efficiencies in frequent pattern mining from big uncertain data
    Carson Kai-Sang Leung
    Richard Kyle MacKinnon
    Fan Jiang
    World Wide Web, 2017, 20 : 571 - 594
  • [9] Finding efficiencies in frequent pattern mining from big uncertain data
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    Jiang, Fan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (03): : 571 - 594
  • [10] Mining Weighted Frequent Patterns from Uncertain Data Streams
    Ovi, Jesan Ahammed
    Ahmed, Chowdhury Farhan
    Leung, Carson K.
    Pazdor, Adam G. M.
    PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM) 2019, 2019, 935 : 917 - 936