Technology Enablers for Big Data, Multi-Stage Analysis in Medical Image Processing

被引:0
|
作者
Bao, Shunxing [1 ]
Parvarthaneni, Prasanna [1 ]
Huo, Yuankai [1 ]
Barve, Yogesh [1 ]
Plassard, Andrew J. [1 ]
Yao, Yuang [1 ]
Sun, Hongyang [1 ]
Lyu, Ilwoo [1 ]
Zald, David H. [2 ]
Landman, Bennett A. [1 ]
Gokhale, Aniruddha [1 ]
机构
[1] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Dept Psychiat & Psychol, Nashville, TN 37235 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2018年
关键词
Hadoop; Medical image processing; Big data multi-stage analysis; Simulator; REGISTRATION ALGORITHMS; BRAIN; MAPREDUCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data medical image processing applications involving multi-stage analysis often exhibit significant variability in processing times ranging from a few seconds to several days. Moreover, due to the sequential nature of executing the analysis stages enforced by traditional software technologies and platforms, any errors in the pipeline are only detected at the later stages despite the sources of errors predominantly being the highly compute-intensive first stage. This wastes precious computing resources and incurs prohibitively higher costs for re-executing the application. The medical image processing community to date remains largely unaware of these issues and continues to use traditional high-performance computing clusters, which incur a high operating cost due to the use of dedicated resources and expensive centralized file systems. To overcome these challenges, this paper proposes an alternative approach for multi-stage analysis in medical image processing by using the Apache Hadoop ecosystem and offering it as a service in the cloud. We make the following contributions. First, we propose a concurrent pipeline execution framework and an associated semi-automatic, real-time monitoring and checkpointing framework that can detect outliers and achieve quality assurance without having to completely execute the expensive first stage of processing thereby expediting the entire multi-stage analysis. Second, we present a simulator to rapidly estimate the execution time for a given multi-stage analysis, which can aid the users in deciding the appropriate approach for their use cases. We conduct empirical evaluation of our framework and show that it requires 76.75% lesser wall time and 29.22% lesser resource time compared to the traditional approach that lacks such a quality assurance mechanism.
引用
收藏
页码:1337 / 1346
页数:10
相关论文
共 50 条
  • [21] Rhipe Platform for Big Data Processing and Analysis
    Jung, Byung Ho
    Shin, Ji Eun
    Lim, Dong Hoon
    KOREAN JOURNAL OF APPLIED STATISTICS, 2014, 27 (07) : 1171 - 1185
  • [22] Deep Learning and Big DataTechnologies in Medical Image Analysis
    Rastogi, Priyanka
    Singh, Vijendra
    Yadav, Monika
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 60 - 63
  • [23] Performance Modeling and Analysis of a Hadoop Cluster for Efficient Big Data Processing
    Lim, JongBeom
    Ahnh, Jong-Suk
    Lee, Kang-Woo
    ADVANCED SCIENCE LETTERS, 2016, 22 (09) : 2314 - 2319
  • [24] CDR Analysis using Big Data Technology
    Elagib, Sara B.
    Hashim, Aisha-Hassan A.
    Olanrewaju, R. F.
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, CONTROL, NETWORKING, ELECTRONICS AND EMBEDDED SYSTEMS ENGINEERING (ICCNEEE), 2015, : 467 - 471
  • [25] Parallel, multi-stage processing of colors, faces and shapes in macaque inferior temporal cortex
    Lafer-Sousa, Rosa
    Conway, Bevil R.
    NATURE NEUROSCIENCE, 2013, 16 (12) : 1870 - 1878
  • [26] Big Data Processing System for Analysis of GitHub Events
    Voinov, Nikita
    Garzon, Katterine Rodriguez
    Nikiforov, Igor
    Drobintsev, Pavel
    PROCEEDINGS OF 2019 XXII INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2019, : 187 - 190
  • [27] Research based on big data analysis of medical industry
    Chen, Lijun
    Lin, Jiaying
    Yi, Zhang
    2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRONIC MATERIALS, COMPUTERS AND MATERIALS ENGINEERING (AEMCME 2019), 2019, 563
  • [28] Performance Evaluation of Big Data Technology on Designing Big Network Traffic Data Analysis System
    Khamphakdee, Nattawat
    Benjamas, Nunnapus
    Saiyod, Saiyan
    2016 JOINT 8TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND INTELLIGENT SYSTEMS (SCIS) AND 17TH INTERNATIONAL SYMPOSIUM ON ADVANCED INTELLIGENT SYSTEMS (ISIS), 2016, : 454 - 459
  • [29] XHAMI - extended HDFS and MapReduce interface for Big Data image processing applications in cloud computing environments
    Kune, Raghavendra
    Konugurthi, Pramod Kumar
    Agarwal, Arun
    Chillarige, Raghavendra Rao
    Buyya, Rajkumar
    SOFTWARE-PRACTICE & EXPERIENCE, 2017, 47 (03) : 455 - 472
  • [30] Network Security Analysis Using Big Data Technology
    Bachupally, Yogeshwar Rao
    Yuan, Xiaohong
    Roy, Kaushik
    SOUTHEASTCON 2016, 2016,