BioHIPI: Biomedical Hadoop Image Processing Interface

被引:0
作者
Calimeri, Francesco [1 ]
Caracciolo, Mirco [1 ]
Marzullo, Aldo [1 ]
Stamile, Claudio [2 ]
机构
[1] Univ Calabria, Dept Math & Comp Sci, Arcavacata Di Rende, Italy
[2] Katholieke Univ Leuven, Dept Elect Engn ESAT, STADIUS, Leuven, Belgium
来源
MACHINE LEARNING, OPTIMIZATION, AND BIG DATA, MOD 2017 | 2018年 / 10710卷
关键词
Big Data; Hadoop; Image processing; BIG DATA;
D O I
10.1007/978-3-319-72926-8_45
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nowadays, the importance of collecting large amounts of data is becoming increasingly crucial, along with the application of efficient and effective analysis techniques, in many areas. One of the most important field in which Big Data is becoming of fundamental importance is the biomedical domain, also due to the decreasing cost of acquiring and analyzing biomedical data. Furthermore, the emergence of more accessible technologies and the increasing speed-up of algorithms, also thanks to parallelization techniques, is helping at making the application of Big Data in healthcare a fast-growing field. This paper presents a novel framework, Biomedical Hadoop Image Processing Interface (BioHIPI), capable of storing biomedical image collections in a Distributed File System (DFS) for exploiting the parallel processing of Big Data on a cluster of machines. The work is based on the Apache Hadoop technology and makes use of the Hadoop Distributed File System (HDFS) for storing images, the MapReduce libraries for parallel programming for processing, and Yet Another Resource Negotiator (YARN) to run processes on the cluster.
引用
收藏
页码:540 / 548
页数:9
相关论文
共 10 条
[1]  
[Anonymous], P 4 ANN S CLOUD COMP, DOI [10.1145/2523616.2523633, DOI 10.1145/2523616.2523633]
[2]  
[Anonymous], 2003, P 19 ACM S OP SYST P, DOI [10.1145/1165389.945450, DOI 10.1145/1165389.945450]
[3]   Mapreduce: Simplified data processing on large clusters [J].
Dean, Jeffrey ;
Ghemawat, Sanjay .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :107-113
[4]  
HENSCHEN D, 2010, INFORM WEEK, V24
[5]   Big Data Application in Biomedical Research and Health Care: A Literature Review [J].
Luo, Jake ;
Wu, Min ;
Gopukumar, Deepika ;
Zhao, Yiqing .
BIOMEDICAL INFORMATICS INSIGHTS, 2016, 8 :1-10
[6]   The National Institutes of Health's Big Data to Knowledge (BD2K) initiative: capitalizing on biomedical big data [J].
Margolis, Ronald ;
Derr, Leslie ;
Dunn, Michelle ;
Huerta, Michael ;
Larkin, Jennie ;
Sheehan, Jerry ;
Guyer, Mark ;
Green, Eric D. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (06) :957-958
[7]   The ImageJ ecosystem: An open platform for biomedical image analysis [J].
Schindelin, Johannes ;
Rueden, Curtis T. ;
Hiner, Mark C. ;
Eliceiri, Kevin W. .
MOLECULAR REPRODUCTION AND DEVELOPMENT, 2015, 82 (7-8) :518-529
[8]  
Sweeney Chris., 2011, HIPI HADOOP IMAGE PR
[9]   An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics [J].
Taylor, Ronald C. .
BMC BIOINFORMATICS, 2010, 11
[10]  
White Tom, 2012, Hadoop: The Definitive Guide