Optimization and Application in Medical Big Document-Data of Apriori Algorithm based on MapReduce

被引:0
|
作者
Li Wei [1 ]
Liu Guangming [1 ]
Shao Yachao [2 ]
Liu Junlong [2 ]
Zuo You [2 ]
机构
[1] Natl Univ Def Technol, Changsha, Hunan, Peoples R China
[2] Natl Supercomp Ctr Tianjin, Tianjin, Peoples R China
来源
2016 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI) | 2016年
关键词
component; Medical big data; NoSQL; MapReduce; Data Mining; Apriori; optimization;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
For the challenges of redundancy, multi-dimension, complex and heterogeneous in medical documents, and to solve the problem that the value hidden in the huge amounts of medical document-data can't be mined, this paper proposed a system called MSPM based on NOSQL and MapReduce. Through storage of key-value pairs, complex and heterogeneous datas are summed up in a unified and convenient format of transaction for Apriori. Then Apriori is executed in parallel through MapReduce. At last, with the strategies of generating all the candidate sets non-recursively and constraint count for candidate sets of interest, it can solve the problem of low speed, high overhead and poor effectiveness for Apriori algorithm in the application of medical data. Testing results has shown the algorithm of optimization is available.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] A Novel Mapreduce Lift Association Rule Mining Algorithm (MRLAR) for Big Data
    Oweis, Nour E.
    Fouad, Mohamed Mostafa
    Oweis, Sami R.
    Owais, Suhail S.
    Snasel, Vaclav
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (03) : 151 - 157
  • [32] Research on sensor network optimization based on improved Apriori algorithm
    Ji, Qiang
    Zhang, Shifeng
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2018,
  • [33] Research on sensor network optimization based on improved Apriori algorithm
    Qiang Ji
    Shifeng Zhang
    EURASIP Journal on Wireless Communications and Networking, 2018
  • [34] Summarization using Mapreduce Framework based Big Data and Hybrid Algorithm (HMM and DBSCAN)
    Belerao, Krushnadeo Tanaji
    Chaudhari, S. B.
    2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 377 - 380
  • [35] Incremental attribute reduction algorithm for big data using MapReduce
    Lv, Ping
    Qian, Jin
    Yue, Xiaodong
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2016, 16 (03) : 641 - 652
  • [36] PARALLEL KNOWLEDGE ACQUISITION ALGORITHM FOR BIG DATA USING MAPREDUCE
    Qian, Jin
    Xia, Min
    Lv, Ping
    PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL. 1, 2015, : 316 - 321
  • [37] Optimization Driven MapReduce Framework for Indexing and Retrieval of Big Data
    Abdalla, Hemn Barzan
    Ahmed, Awder Mohammed
    Al Sibahee, M. A.
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2020, 14 (05): : 1886 - 1908
  • [38] The optimization for recurring queries in big data analysis system with MapReduce
    Zhang, Bin
    Wang, Xiaoyang
    Zheng, Zhigao
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 87 : 549 - 556
  • [39] K-means Clustering Optimization Algorithm Based on MapReduce
    Li, Zhihua
    Song, Xudong
    Zhu, Wenhui
    Chen, Yanxia
    PROCEEDINGS OF THE 2015 INTERNATIONAL SYMPOSIUM ON COMPUTERS & INFORMATICS, 2015, 13 : 198 - 203
  • [40] Atrak: a MapReduce-based data warehouse for big data
    Barkhordari, Mohammadhossein
    Niamanesh, Mahdi
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (10) : 4596 - 4610