Distributed synthesized association mining for big transactional data

被引:4
|
作者
Pal, Amrit [1 ,2 ]
Kumar, Manish [2 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura, India
[2] Indian Inst Informat Technol Allahabad, Dept Informat Technol, Prayagraj, India
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2020年 / 45卷 / 01期
关键词
Big Data; HDFS; MapReduce; Apriori; frequent itemset; association rule; DATA SETS; RULES; PATTERNS;
D O I
10.1007/s12046-020-01380-8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Data is increasing rapidly day by day along with the transactional database. Dividing this data and storing it in a distributed manner is an effective way for storage and retrieval. Mining such distributed data with minimum dependence between sub-problems is a crucial task. Finding frequent itemsets and corresponding association rules is a big challenge while considering the aggregation in a distributed environment. To overcome these challenges, we propose a distributed frequent itemset generation and association rule mining algorithm using MapReduce programming model. The proposed scheme generates frequent itemset and mine association rules using a synthesized distributed technique. The rules are mined in a distributed manner, and then weights are assigned to subsets of data and association rules. A proper mixture of association rules that are generated in distributed manner is done using a weighted approach. This paper presents a novel MapReduce-based synthesis approach, which can work well over a distributed storage of large amount of data.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Anytime Frequent Itemset Mining of Transactional Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    BIG DATA RESEARCH, 2020, 21
  • [22] Recommendation using Frequent Itemset Mining in Big Data
    Kunjachan, Honeytta
    Hareesh, M. J.
    Sreedevi, K. M.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 561 - 566
  • [23] Distributed Data Association Rule Mining: Tools and Techniques
    Sethi, Manoj
    Jindal, Rajni
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 481 - 485
  • [24] Issues in Quantitative Association Rule Mining: A Big Data Perspective
    Adhikary, Dhrubajit
    Roy, Swarup
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT ICT4SD 2015, VOL 2, 2016, 409 : 377 - 385
  • [25] A Distributed Method for Fast Mining Frequent Patterns From Big Data
    Huang, Peng-Yu
    Cheng, Wan-Shu
    Chen, Ju-Chin
    Chung, Wen-Yu
    Chen, Young-Lin
    Lin, Kawuu W.
    IEEE ACCESS, 2021, 9 : 135144 - 135159
  • [26] A Group Mining Method for Big Data on Distributed Vehicle Trajectories in WAN
    Yang, Jie
    Li, Xiaoping
    Wang, Dandan
    Wang, Jia
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2015,
  • [27] Neutrosophic Association Rule Mining Algorithm for Big Data Analysis
    Abdel-Basset, Mohamed
    Mohamed, Mai
    Smarandache, Florentin
    Chang, Victor
    SYMMETRY-BASEL, 2018, 10 (04):
  • [28] Association Rule Mining Algorithm Improvement and Implementation Analysis for Big Data Oriented Education
    Rui, Jiang
    EDUCATION AND MANAGEMENT INNOVATION, 2017, : 268 - 273
  • [29] Fuzzy Models for Big Data Mining
    Ducange, Pietro
    FUZZY LOGIC AND APPLICATIONS, WILF 2018, 2019, 11291 : 257 - 260
  • [30] Data Mining with Big Data
    Sowmya, R.
    Suneetha, K. R.
    PROCEEDINGS OF 2017 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS AND CONTROL (ISCO 2017), 2017, : 246 - 250