MC Framework: High-performance Distributed Framework for Standalone Data Analysis Packages over Hadoop-based Cloud

被引:0
|
作者
Chen, Chao-Chun [1 ]
Giang, Nguyen Huu Tinh [1 ]
Lin, Tzu-Chao [1 ]
Hung, Min-Hsiung [2 ]
机构
[1] Natl Cheng Kung Univ, Inst Mfg Info & Sys, Dept Comp Sci & Info Engr, Tainan 70101, Taiwan
[2] Chinese Culture Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
来源
2013 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING (GRC) | 2013年
关键词
MapReduce; Hadoop; cloud adaptor; multi-users scheduling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Hadoop MapReduce is the programming model of designing the scalable distributed computing applications, that provides developers can attain automatic parallelization. However, most complex manufacturing systems are arduous and restrictive to migrate to private clouds, due to the platform incompatible and tremendous complexity of system reconstruction. For increasing the efficiency of manufacturing systems with minimum efforts on modifying source codes, a high-performance framework is designed in this paper, called Multi-users-based Cloud-Adaptor Framework (MC-Framework), which provides the simple interface to users for fairly executing requested tasks worked with traditional standalone data analysis packages in MapReduce-based private cloud environments. Moreover, this framework focuses on multiuser workloads, but the default Hadoop scheduling scheme, i.e., FIFO, would increase delay under multiuser scenarios. Hence, a new scheduling mechanism, called Job-Sharing Scheduling, is designed to explore and fairly share the jobs to machines in the private cloud. Then, we prototype an experimental virtual-metrology module of a manufacturing system as a case study to verify and analysis the proposed MC-Framework. The results of our experiments indicate that our proposed framework enormously improved the time performance compared with the original package.
引用
收藏
页码:27 / 32
页数:6
相关论文
共 35 条
  • [11] SSFile: A novel column-store for efficient data analysis in Hadoop-based distributed systems
    Son, Jihoon
    Ryu, Hyoseok
    Yi, Sungmin
    Chung, Yon Dohn
    INFORMATION SCIENCES, 2015, 316 : 68 - 86
  • [12] A COMPARATIVE ANALYSIS OF CONVENTIONAL HADOOP WITH PROPOSED CLOUD ENABLED HADOOP FRAMEWORK FOR SPATIAL BIG DATA PROCESSING
    Tripathi, A. K.
    Agrawal, S.
    Gupta, R. D.
    ISPRS TC V MID-TERM SYMPOSIUM GEOSPATIAL TECHNOLOGY - PIXEL TO PEOPLE, 2018, 4-5 : 425 - 430
  • [13] A Hadoop based Framework to Process Geo-distributed Big Data
    Cavallo, Marco
    Cusma', Lorenzo
    Di Modica, Giuseppe
    Polito, Carmelo
    Tomarchio, Orazio
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND SERVICES SCIENCE, VOL 1 (CLOSER), 2016, : 178 - 185
  • [14] Research and Practice of Big Data Analysis Process Based on Hadoop Framework
    Jiang, Hui
    PROCEEDINGS OF 2019 IEEE 3RD INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC 2019), 2019, : 2044 - 2047
  • [15] A security framework in G-Hadoop for big data computing across distributed Cloud data centres
    Zhao, Jiaqi
    Wang, Lizhe
    Tao, Jie
    Chen, Jinjun
    Sun, Weiye
    Ranjan, Rajiv
    Kolodziej, Joanna
    Streit, Achim
    Georgakopoulos, Dimitrios
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2014, 80 (05) : 994 - 1007
  • [16] A Performance Analysis of MapReduce Applications on Big Data in Cloud based Hadoop
    Gohil, Parth
    Garg, Dweepna
    Panchal, Bakul
    2014 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2014,
  • [17] A Hadoop-Based Framework for Large-Scale Landmine Detection Using Ubiquitous Big Satellite Imaging Data
    El-Kazzaz, Sahar
    El-Mahdy, Ahmed
    23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, : 274 - 278
  • [18] A distributed data mining system framework for mobile internet access log based on hadoop
    Jiang, Yunliang
    Yang, Jiangang
    Tang, Liang
    Liu, Yong
    Zhao, Xiaoming
    Hao, Xiulan
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2015, 8971 : 243 - 252
  • [19] High-Performance Multiclass Classification Framework Using Cloud Computing Architecture
    Lin, Feng-Sheng
    Shen, Chia-Ping
    Liu, Chia-Hung
    Lin, Han
    Huang, Chi-Ying F.
    Kao, Cheng-Yan
    Lai, Feipei
    Lin, Jeng-Wei
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2015, 35 (06) : 795 - 802
  • [20] High-Performance Multiclass Classification Framework Using Cloud Computing Architecture
    Feng-Sheng Lin
    Chia-Ping Shen
    Chia-Hung Liu
    Han Lin
    Chi-Ying F. Huang
    Cheng-Yan Kao
    Feipei Lai
    Jeng-Wei Lin
    Journal of Medical and Biological Engineering, 2015, 35 : 795 - 802