Technology Enablers for Big Data, Multi-Stage Analysis in Medical Image Processing

被引:0
|
作者
Bao, Shunxing [1 ]
Parvarthaneni, Prasanna [1 ]
Huo, Yuankai [1 ]
Barve, Yogesh [1 ]
Plassard, Andrew J. [1 ]
Yao, Yuang [1 ]
Sun, Hongyang [1 ]
Lyu, Ilwoo [1 ]
Zald, David H. [2 ]
Landman, Bennett A. [1 ]
Gokhale, Aniruddha [1 ]
机构
[1] Vanderbilt Univ, Dept Elect Engn & Comp Sci, Nashville, TN 37235 USA
[2] Vanderbilt Univ, Dept Psychiat & Psychol, Nashville, TN 37235 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2018年
关键词
Hadoop; Medical image processing; Big data multi-stage analysis; Simulator; REGISTRATION ALGORITHMS; BRAIN; MAPREDUCE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data medical image processing applications involving multi-stage analysis often exhibit significant variability in processing times ranging from a few seconds to several days. Moreover, due to the sequential nature of executing the analysis stages enforced by traditional software technologies and platforms, any errors in the pipeline are only detected at the later stages despite the sources of errors predominantly being the highly compute-intensive first stage. This wastes precious computing resources and incurs prohibitively higher costs for re-executing the application. The medical image processing community to date remains largely unaware of these issues and continues to use traditional high-performance computing clusters, which incur a high operating cost due to the use of dedicated resources and expensive centralized file systems. To overcome these challenges, this paper proposes an alternative approach for multi-stage analysis in medical image processing by using the Apache Hadoop ecosystem and offering it as a service in the cloud. We make the following contributions. First, we propose a concurrent pipeline execution framework and an associated semi-automatic, real-time monitoring and checkpointing framework that can detect outliers and achieve quality assurance without having to completely execute the expensive first stage of processing thereby expediting the entire multi-stage analysis. Second, we present a simulator to rapidly estimate the execution time for a given multi-stage analysis, which can aid the users in deciding the appropriate approach for their use cases. We conduct empirical evaluation of our framework and show that it requires 76.75% lesser wall time and 29.22% lesser resource time compared to the traditional approach that lacks such a quality assurance mechanism.
引用
收藏
页码:1337 / 1346
页数:10
相关论文
共 50 条
  • [1] Cloud Engineering Principles and Technology Enablers for Medical Image Processing-as-a-Service
    Bao, Shunxing
    Plassard, Andrew J.
    Landman, Bennett A.
    Gokhale, Aniruddha
    2017 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2017), 2017, : 127 - 137
  • [2] Intermediate Data Caching Optimization for Multi-Stage and Parallel Big Data Frameworks
    Yang, Zhengyu
    Jia, Danlin
    Ioannidis, Stratis
    Mi, Ningfang
    Sheng, Bo
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 277 - 284
  • [3] Application of data mining technology in medical image processing
    Wang, Rui
    Wang, Jinguo
    Wang, Na
    Proceedings of the 2016 International Conference on Engineering and Advanced Technology, 2016, 82 : 18 - 21
  • [4] Algorithmic Enhancements to Big Data Computing Frameworks for Medical Image Processing
    Bao, Shunxing
    Landman, Bennett A.
    Gokhale, Aniruddha
    2017 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2017), 2017, : 13 - 16
  • [5] A Data Colocation Grid Framework for Big Data Medical Image Processing - Backend Design
    Bao, Shunxing
    Huo, Yuankai
    Parvathaneni, Prasanna
    Plassard, Andrew J.
    Bermudez, Camilo
    Yao, Yuang
    Lyu, Ilwoo
    Gokhale, Aniruddha
    Landman, Bennett A.
    MEDICAL IMAGING 2018: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS, 2018, 10579
  • [6] A Big Data Processing Platform for Medical Records in Cloud
    Yang, Chao-Tung
    Liu, Jung-Chun
    Lu, Hsin-Wen
    Yan, Yin-Zhen
    Chu, Cheng-Chung
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1406 - 1415
  • [7] Big Data in multiscale modelling: from medical image processing to personalized models
    Geroski, Tijana
    Jakovljevic, Djordje
    Filipovic, Nenad
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [8] Big Data in multiscale modelling: from medical image processing to personalized models
    Tijana Geroski
    Djordje Jakovljević
    Nenad Filipović
    Journal of Big Data, 10
  • [9] A Framework for Medical Big Data Processing: An Art of Survey
    Adbullah, A. Sheik
    Selvakumar, S.
    Karthik, K. Gokul
    2018 10TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2018, : 63 - 69
  • [10] Design of Processing Model for Connected Car Data Using Big Data Technology
    Nkenyereye, Lionel
    Jang, Jong Wook
    ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2017, 421 : 143 - 148