Comprehensive mining of frequent itemsets for a combination of certain and uncertain databases

被引:0
|
作者
Wazir S. [1 ]
Beg M.M.S. [2 ]
Ahmad T. [1 ]
机构
[1] Department of Computer Engineering, Jamia Millia Islamia, New Delhi
[2] Department of Computer Engineering, Aligarh Muslim University, Aligarh
关键词
Approximate Frequent Items; Certain and Uncertain Transactional Database; Expected Support; Frequent Itemset Mining; Normal Distribution; Poisson Distribution;
D O I
10.1007/s41870-019-00310-0
中图分类号
学科分类号
摘要
The mechanism of Frequent Itemset Mining can be performed by using sequential algorithms like Apriori on a standalone system, or it can be applied using parallel algorithms like Count Distribution on a distributed system. Due to communication overhead in parallel algorithms and exponential candidate generation, many algorithms were developed for calculating frequent items either over the certain or uncertain database. Yet not a single algorithm is developed so far which can cover the requirement of generating frequent itemset by combining both the databases. We had proposed earlier MasterApriori algorithm which is used to calculate Approximate Frequent Items for a combination of certain and uncertain databases with the support of Apriori for Certain and Expected support based UApriori for the uncertain database. In this paper, the researcher would like to extend the former work by using Poisson and Normal Distribution based UApriori for the uncertain database. In proposed algorithms, there is only one-time communication between sites where data is distributed, which reduce the communication overhead. Scalability and efficiency of proposed algorithms are then checked by using standard, and synthetic databases. The performances were then measured by comparing time taken and a number of frequent items generated by each algorithm. © 2019, Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1205 / 1216
页数:11
相关论文
共 50 条
  • [31] An integrated cuckoo search-genetic algorithm for mining frequent itemsets
    Sukanya, N. S.
    Thangaiah, P. Ranjit Jeba
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2022, 25 (03): : 671 - 690
  • [32] Parallel frequent itemsets mining using distributed graphic processing units
    Zoraghchian, Ali Abbas
    Sohrabi, Mohammad Karim
    Yaghmaee, Farzin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (30) : 43873 - 43895
  • [33] Efficient Mining of Maximal Frequent Itemsets Based on M-Step Lookahead
    Meyer, Elijah L.
    Chung, Soon M.
    PROCEEDINGS OF 2018 5TH INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2018,
  • [34] A New Rymon Tree Based Procedure for Mining Statistically Significant Frequent Itemsets
    Stanisic, P.
    Tomovic, S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2010, 5 (04) : 567 - 577
  • [35] A multi-objective evolutionary approach for mining frequent and high utility itemsets
    Zhang, Lei
    Fu, Guanglong
    Cheng, Fan
    Qiu, Jianfeng
    Su, Yansen
    APPLIED SOFT COMPUTING, 2018, 62 : 974 - 986
  • [36] A Privacy Frequent Itemsets Mining Framework for Collaboration in IoT Using Federated Learning
    Wu, Jimmy Ming-Tai
    Teng, Qian
    Huda, Shamsul
    Chen, Yeh-Cheng
    Chen, Chien-Ming
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2023, 19 (02)
  • [37] Efficient Mining of Frequent itemsets in Social Network Data based on MapReduce Framework
    Farzanyar, Zahra
    Cercone, Nick
    2013 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2013, : 1183 - 1188
  • [38] A new algorithm for fast mining frequent itemsets using N-lists
    Deng ZhiHong
    Wang ZhongHui
    Jiang JiaJian
    SCIENCE CHINA-INFORMATION SCIENCES, 2012, 55 (09) : 2008 - 2030
  • [39] A new algorithm for fast mining frequent itemsets using N-lists
    ZhiHong Deng
    ZhongHui Wang
    JiaJian Jiang
    Science China Information Sciences, 2012, 55 : 2008 - 2030
  • [40] A new algorithm for fast mining frequent itemsets using N-lists
    DENG ZhiHong
    ScienceChina(InformationSciences), 2012, 55 (09) : 2008 - 2030