Optimization and Application in Medical Big Document-Data of Apriori Algorithm based on MapReduce

被引:0
|
作者
Li Wei [1 ]
Liu Guangming [1 ]
Shao Yachao [2 ]
Liu Junlong [2 ]
Zuo You [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Natl Supercomp Ctr Tianjin, Tianjin, Peoples R China
来源
2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI) | 2016年
关键词
component; Medical big data; NoSQL; MapReduce; Data Mining; Apriori; optimization;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
For the challenges of redundancy, multi-dimension, complex and heterogeneous in medical documents, and to solve the problem that the value hidden in the huge amounts of medical document-data can't be mined, this paper proposed a system called MSPM based on NOSQL and MapReduce. Through storage of key-value pairs, complex and heterogeneous datas are summed up in a unified and convenient format of transaction for Apriori. Then Apriori is executed in parallel through MapReduce. At last, with the strategies of generating all the candidate sets non-recursively and constraint count for candidate sets of interest, it can solve the problem of low speed, high overhead and poor effectiveness for Apriori algorithm in the application of medical data. Testing results has shown the algorithm of optimization is available.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Atrak: a MapReduce-based data warehouse for big data
    Mohammadhossein Barkhordari
    Mahdi Niamanesh
    The Journal of Supercomputing, 2017, 73 : 4596 - 4610
  • [42] Modulo Based Data Placement Algorithm for Energy Consumption Optimization of MapReduce System
    Song, Jie
    He, HongYan
    Wang, Zhi
    Yu, Ge
    Pierson, Jean-Marc
    JOURNAL OF GRID COMPUTING, 2018, 16 (03) : 409 - 424
  • [43] Research on Application of Big Data in Medical Industry
    Li, Guijie
    Liu, Yutao
    Cai, Hengyu
    2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 763 - 765
  • [44] Modulo Based Data Placement Algorithm for Energy Consumption Optimization of MapReduce System
    Jie Song
    HongYan He
    Zhi Wang
    Ge Yu
    Jean-Marc Pierson
    Journal of Grid Computing, 2018, 16 : 409 - 424
  • [45] Distributed Whale Optimization Algorithm based on MapReduce
    Khalil, Yasser
    Alshayeji, Mohammad
    Ahmad, Imtiaz
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (01)
  • [46] Analysis of Effectiveness of Apriori Algorithm in Medical Billing Data Mining
    Abdullah, Umair
    Ahmad, Jamil
    Ahmed, Aftab
    2008 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2008, : 329 - +
  • [47] Big data clustering with varied density based on MapReduce
    Safanaz Heidari
    Mahmood Alborzi
    Reza Radfar
    Mohammad Ali Afsharkazemi
    Ali Rajabzadeh Ghatari
    Journal of Big Data, 6
  • [48] A visualization algorithm for medical big data based on deep learning
    Qiu, Yongjian
    Lu, Jing
    MEASUREMENT, 2021, 183
  • [49] Big data clustering with varied density based on MapReduce
    Heidari, Safanaz
    Alborzi, Mahmood
    Radfar, Reza
    Afsharkazemi, Mohammad Ali
    Ghatari, Ali Rajabzadeh
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [50] A MapReduce-Based ELM for Regression in Big Data
    Wu, B.
    Yan, T. H.
    Xu, X. S.
    He, B.
    Li, W. H.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 164 - 173