Processing of Medical Different Types of Data Using Hadoop and Java']Java MapReduce

被引:0
作者
Boyko, Nataliya [1 ]
Tkachuk, Nazar [1 ]
机构
[1] Lviv Polytech Natl Univ, Profesorska St 1, UA-79013 Lvov, Ukraine
来源
IDDM 2020: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE | 2020年 / 2753卷
关键词
Data processing; Hadoop; !text type='Java']Java[!/text] Map/Reduce; Heterogeneous data processing; MapReduce; Big data; Data Analysis; HDFS; multiple input;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article shows the analysis of sample data of different types using Java MapReduce on the Hadoop platform. The Java programming language and the Java MapReduce API are used to work on large amounts of data ("Big Data") that have different formats and structures. So, the task was to process the medical data and get a single source file. The result of the program was saved in the HDFS file system. These source data can then be saved to the NTFS file system using Sqoop or the files can be copied manually to the system for further processing.
引用
收藏
页数:9
相关论文
共 19 条
  • [1] Aggarwal CC, 2000, SIGMOD REC, V29, P70, DOI 10.1145/335191.335383
  • [2] Automatic subspace clustering of high dimensional data
    Agrawal, R
    Gehrke, J
    Gunopulos, D
    Raghavan, P
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (01) : 5 - 33
  • [3] Ankerst M., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P179, DOI 10.1145/347090.347124
  • [4] Density connected clustering with local subspace preferences
    Böhm, C
    Kailing, K
    Kriegel, HP
    Kröger, P
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 27 - 34
  • [5] Boyko N., 2018 IEEE 13 INT SCI, V2, DOI [10.1109/STCCSIT.2018.8526745, DOI 10.1109/STCCSIT.2018.8526745]
  • [6] Boyko N, 2019, CEUR WORKSHOP PROCEE, V2488, P307
  • [7] Boyko N, 2019, CEUR WORKSHOP PROCEE, V2488, P208
  • [8] Boyko Nataliya, 2019, P INT WORKSH CYB HYG, P571
  • [9] Estivill-Castro V., 2000, 9 INT S SPAT DAT HAN, P26
  • [10] ICEAGE: Interactive clustering and exploration of large and high-dimensional geodata
    Guo, DS
    Peuquet, DJ
    Gahegan, M
    [J]. GEOINFORMATICA, 2003, 7 (03) : 229 - 253