Optimization and Application in Medical Big Document-Data of Apriori Algorithm based on MapReduce

被引:0
|
作者
Li Wei [1 ]
Liu Guangming [1 ]
Shao Yachao [2 ]
Liu Junlong [2 ]
Zuo You [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Natl Supercomp Ctr Tianjin, Tianjin, Peoples R China
来源
2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI) | 2016年
关键词
component; Medical big data; NoSQL; MapReduce; Data Mining; Apriori; optimization;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
For the challenges of redundancy, multi-dimension, complex and heterogeneous in medical documents, and to solve the problem that the value hidden in the huge amounts of medical document-data can't be mined, this paper proposed a system called MSPM based on NOSQL and MapReduce. Through storage of key-value pairs, complex and heterogeneous datas are summed up in a unified and convenient format of transaction for Apriori. Then Apriori is executed in parallel through MapReduce. At last, with the strategies of generating all the candidate sets non-recursively and constraint count for candidate sets of interest, it can solve the problem of low speed, high overhead and poor effectiveness for Apriori algorithm in the application of medical data. Testing results has shown the algorithm of optimization is available.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Performance optimization of MapReduce-based Apriori algorithm on Hadoop cluster
    Singh, Sudhakar
    Garg, Rakhi
    Mishra, P. K.
    COMPUTERS & ELECTRICAL ENGINEERING, 2018, 67 : 348 - 364
  • [2] Apriori Algorithm Optimization Study Based on MapReduce
    Li Chunqing
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTOMATION, MECHANICAL CONTROL AND COMPUTATIONAL ENGINEERING, 2015, 124 : 1466 - 1470
  • [3] AN ALGORITHM OF APRIORI BASED ON MEDICAL BIG DATA AND CLOUD COMPUTING
    Cui, Xiaoyan
    Yang, Shimeng
    Wang, Daming
    PROCEEDINGS OF 2016 4TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (IEEE CCIS 2016), 2016, : 361 - 365
  • [4] Complex Statistical Analysis of Big Data: Implementation and Application of Apriori and FP-Growth Algorithm Based on MapReduce
    Rong, Zbuobo
    Xia, Dawen
    Zhang, Zili
    PROCEEDINGS OF 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS), 2012, : 968 - 972
  • [5] Research on Improved Apriori Algorithm based on MapReduce and HBase
    Feng, Dongyu
    Zhu, Ligu
    Zhang, Lei
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 887 - 891
  • [6] Apriori Versions Based on MapReduce for Mining Frequent Patterns on Big Data
    Maria Luna, Jose
    Padillo, Francisco
    Pechenizkiy, Mykola
    Ventura, Sebastian
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (10) : 2851 - 2865
  • [7] AMPO: Algorithm for MapReduce Performance Optimization for Enhancing Big Data Analytics
    Yambem, Nandita
    Nandakumar, A. N.
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 717 - 723
  • [8] An Improved Parallel Association Rules Algorithm Based on MapReduce Framework for Big Data
    Zhou, Xinhao
    Huang, Yongfeng
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 284 - 288
  • [9] Research on Apriori Algorithm Optimization of Cloud Computing and Big Data in Software Engineering
    Rui, Wang
    2018 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2018), 2018, : 53 - 56
  • [10] Medical diagnosis data mining based on improved Apriori algorithm
    Ma, D., 1600, Academy Publisher (09): : 1339 - 1345