MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC) | 2013年
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 34 条
  • [1] Design and Implement a MapReduce Framework for Executing Standalone Software Packages in Hadoop-based Distributed Environmentsn
    Chen, Chao-Chun
    Hung, Min-Hsiung
    Giang, Nguyen Huu Tinh
    Lin, Hsuan-Chun
    Lin, Tzu-Chao
    SMART SCIENCE, 2013, 1 (02) : 99 - 107
  • [2] A Hadoop-Based Visualization and Diagnosis Framework for Earth Science Data
    Zhou, Shujia
    Yang, Xi
    Li, Xiaowen
    Matsui, Toshihisa
    Liu, Si
    Sun, Xian-He
    Tao, Weikuo
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 1972 - 1977
  • [3] Hadoop framework implementation and performance analysis on a cloud
    Ozen, Goksu Zekiye
    Tekerek, Mehmet
    Sultanov, Rayimbek
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (02) : 705 - 716
  • [4] Modern Framework for Distributed Healthcare Data Analytics Based on Hadoop
    Raja, P. Vignesh
    Sivasankar, E.
    INFORMATION AND COMMUNICATION TECHNOLOGY, 2014, 8407 : 348 - 355
  • [5] A HADOOP-BASED DISTRIBUTED FRAMEWORK FOR EFFICIENT MANAGING AND PROCESSING BIG REMOTE SENSING IMAGES
    Wang, C.
    Hu, F.
    Hu, X.
    Zhao, S.
    Wen, W.
    Yang, C.
    ISPRS International Workshop on Spatiotemporal Computing, 2015, : 63 - 66
  • [6] Optimizing the Hadoop MapReduce Framework with high-performance storage devices
    Moon, Sangwhan
    Lee, Jaehwan
    Sun, Xiling
    Kee, Yang-suk
    JOURNAL OF SUPERCOMPUTING, 2015, 71 (09): : 3525 - 3548
  • [7] Optimizing the Hadoop MapReduce Framework with high-performance storage devices
    Sangwhan Moon
    Jaehwan Lee
    Xiling Sun
    Yang-suk Kee
    The Journal of Supercomputing, 2015, 71 : 3525 - 3548
  • [8] A Data Processing Framework for Cloud Environment Based on Hadoop and Grid Middleware
    Kim, Hyukho
    Kim, Woongsup
    Lee, Kyoungmook
    Kim, Yangwoo
    GRID AND DISTRIBUTED COMPUTING, 2011, 261 : 515 - +
  • [9] BIG-BIO: - Big Data Hadoop-based Analytic Cluster Framework for Bioinformatics
    Abul Seoud, Rania Ahmed Abdel Azeem
    Mahmoud, Mahmoud Ahmed
    Ramadan, Amr Essam Eldin
    2017 INTERNATIONAL CONFERENCE ON INFORMATICS, HEALTH & TECHNOLOGY (ICIHT), 2017,
  • [10] An efficient Hadoop-based brain tumor detection framework using big data analytic
    Kaur Chahal, Prabhjot
    Pandey, Shreelekha
    SOFTWARE-PRACTICE & EXPERIENCE, 2022, 52 (03): : 805 - 818