MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC) | 2013年
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 35 条
  • [31] Graph BLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU
    Yang, Carl
    Buluc, Aydin
    Owens, John D.
    ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 2022, 48 (01):
  • [32] Lit: A High Performance Massive Data Computing Framework Based on CPU/GPU Cluster
    Zhai, Yanlong
    Mbarushimana, Emmanuel
    Li, Wei
    Zhang, Jing
    Guo, Ying
    2013 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2013,
  • [33] Cost-Efficient High-Performance Internet-Scale Data Analytics over Multi-Cloud Environments
    Imai, Shigeru
    Patterson, Stacy
    Varela, Carlos A.
    2015 15TH IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND GRID COMPUTING, 2015, : 793 - 796
  • [34] A cloud-based system framework for performing online viewing, storage, and analysis on big data of massive BIMs
    Chen, Hung-Ming
    Chang, Kai-Chuan
    Lin, Tsung-Hsi
    AUTOMATION IN CONSTRUCTION, 2016, 71 : 34 - 48
  • [35] SparkFlow: Towards High-Performance Data Analytics for Spark-based Genome Analysis
    Filgueira, Rosa
    Awaysheh, Feras M.
    Carter, Adam
    White, Darren J.
    Rana, Omer
    2022 22ND IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER, CLOUD AND INTERNET COMPUTING (CCGRID 2022), 2022, : 1007 - 1016