MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC) | 2013年
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 35 条
  • [21] Software-Defined Networking for Scalable Cloud-based Services to Improve System Performance of Hadoop-based Big Data Applications
    Hagos, Desta Haileselassie
    INTERNATIONAL JOURNAL OF GRID AND HIGH PERFORMANCE COMPUTING, 2016, 8 (02) : 1 - 22
  • [22] Distributed XPath Query Processing over Large XML Data based on MapReduce framework
    Fan, Hongjie
    Wang, Dongsheng
    Liu, Junfei
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1447 - 1453
  • [23] Handling distributed XML queries over large XML data based on MapReduce framework
    Fan, Hongjie
    Ma, Zhiyi
    Wang, Dianhui
    Liu, Junfei
    INFORMATION SCIENCES, 2018, 453 : 1 - 20
  • [24] Performance Analysis of Matrix and Graph Computations using Data Compression Techniques in MPI and Hadoop MapReduce in Big Data Framework
    Ramakrishnaiah, Nagendla
    Reddy, Sirigiri Konda
    2017 IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES AND MANAGEMENT FOR COMPUTING, COMMUNICATION, CONTROLS, ENERGY AND MATERIALS (ICSTM), 2017, : 54 - 62
  • [25] MaReIA: a cloud MapReduce based high performance whole slide image analysis framework
    Hoang Vo
    Jun Kong
    Dejun Teng
    Yanhui Liang
    Ablimit Aji
    George Teodoro
    Fusheng Wang
    Distributed and Parallel Databases, 2019, 37 : 251 - 272
  • [26] MaReIA: a cloud MapReduce based high performance whole slide image analysis framework
    Vo, Hoang
    Kong, Jun
    Teng, Dejun
    Liang, Yanhui
    Aji, Ablimit
    Teodoro, George
    Wang, Fusheng
    DISTRIBUTED AND PARALLEL DATABASES, 2019, 37 (02) : 251 - 272
  • [27] An Extended IMS Framework With a High-Performance and Scalable Distributed Storage and Computing System
    Seraoui, Youssef
    Raouyane, Brahim
    Bellafkih, Mostafa
    2017 INTERNATIONAL SYMPOSIUM ON NETWORKS, COMPUTERS AND COMMUNICATIONS (ISNCC), 2017,
  • [28] High-performance XML modeling of parallel queries based on MapReduce framework
    Kunfang Song
    Hongwei Lu
    Cluster Computing, 2016, 19 : 1975 - 1986
  • [29] Deep recurrent neural network-based Hadoop framework for COVID prediction with applications to big data in cloud computing
    Rao, D. B. Jagannadha
    Polepally, Vijayakumar
    Prabhu, S. Nagendra
    Kalpana, Parsi
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2023, 21 (01) : 36 - 47
  • [30] High-performance XML modeling of parallel queries based on MapReduce framework
    Song, Kunfang
    Lu, Hongwei
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2016, 19 (04): : 1975 - 1986