Comprehensive mining of frequent itemsets for a combination of certain and uncertain databases

被引:0
|
作者
Wazir S. [1 ]
Beg M.M.S. [2 ]
Ahmad T. [1 ]
机构
[1] Department of Computer Engineering, Jamia Millia Islamia, New Delhi
[2] Department of Computer Engineering, Aligarh Muslim University, Aligarh
关键词
Approximate Frequent Items; Certain and Uncertain Transactional Database; Expected Support; Frequent Itemset Mining; Normal Distribution; Poisson Distribution;
D O I
10.1007/s41870-019-00310-0
中图分类号
学科分类号
摘要
The mechanism of Frequent Itemset Mining can be performed by using sequential algorithms like Apriori on a standalone system, or it can be applied using parallel algorithms like Count Distribution on a distributed system. Due to communication overhead in parallel algorithms and exponential candidate generation, many algorithms were developed for calculating frequent items either over the certain or uncertain database. Yet not a single algorithm is developed so far which can cover the requirement of generating frequent itemset by combining both the databases. We had proposed earlier MasterApriori algorithm which is used to calculate Approximate Frequent Items for a combination of certain and uncertain databases with the support of Apriori for Certain and Expected support based UApriori for the uncertain database. In this paper, the researcher would like to extend the former work by using Poisson and Normal Distribution based UApriori for the uncertain database. In proposed algorithms, there is only one-time communication between sites where data is distributed, which reduce the communication overhead. Scalability and efficiency of proposed algorithms are then checked by using standard, and synthetic databases. The performances were then measured by comparing time taken and a number of frequent items generated by each algorithm. © 2019, Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1205 / 1216
页数:11
相关论文
共 50 条
  • [21] Mining Distributed Frequent Itemsets Using a Gossip Based Protocol
    Bagheri, Maryam
    Mirian-Hosseinabadi, Seyed-Hassan
    Mashayekhi, Hoda
    Habibi, Jafar
    2012 9TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INTELLIGENCE & COMPUTING AND 9TH INTERNATIONAL CONFERENCE ON AUTONOMIC & TRUSTED COMPUTING (UIC/ATC), 2012, : 780 - 785
  • [22] Mining Approximate Frequent Itemsets Using Pattern Growth Approach
    Bashir, Shariq
    Lai, Daphne Teck Ching
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (04): : 627 - 644
  • [23] MapDiff-FI : Map Different Sets for Frequent Itemsets Mining
    Khongtuk, Thaweesak
    Boonbrahm, Salin
    Jaruskulchai, Chuleerat
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON APPLIED SCIENCE AND TECHNOLOGY (ICAST'18), 2018, 2016
  • [24] Parallel frequent itemsets mining using distributed graphic processing units
    Ali Abbas Zoraghchian
    Mohammad Karim Sohrabi
    Farzin Yaghmaee
    Multimedia Tools and Applications, 2022, 81 : 43873 - 43895
  • [25] PPS: Parallel Pincer Search for Mining Frequent Itemsets Based on Spark
    Sethi, Krishan Kumar
    Dharavath, Ramesh
    Nyakotey, Samuel
    PROCEEDINGS OF THE EIGHTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR 2016), 2018, 614 : 351 - 363
  • [26] An efficient pattern growth approach for mining fault tolerant frequent itemsets
    Bashir, Shariq
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 143
  • [27] Probabilistic load balancing method for parallel mining of all frequent itemsets
    Kessl, Robert
    Tvrdik, Pavel
    PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 578 - +
  • [28] Accelerating Frequent Itemsets Mining on the Cloud: A Map Reduce -Based Approach
    Farzanyar, Zahra
    Cercone, Nick
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 592 - 598
  • [29] Mining top-K frequent itemsets from data streams
    Raymond Chi-Wing Wong
    Ada Wai-Chee Fu
    Data Mining and Knowledge Discovery, 2006, 13 : 193 - 217
  • [30] Analyzing Expected Support-based Frequent Itemsets over Uncertain Data
    Chen, Fengjuan
    Qu, Wenyu
    Li, Zhiyang
    IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1721 - 1725