AMPO: Algorithm for MapReduce Performance Optimization for Enhancing Big Data Analytics

被引:0
作者
Yambem, Nandita [1 ]
Nandakumar, A. N. [2 ]
机构
[1] Vemana IT, ISE Dept, VTU RRC, Bangalore, Karnataka, India
[2] GSSSIETW, Dept CSE, Mysuru, Karnataka, India
来源
2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT) | 2017年
关键词
Hadoop; Map Reduce; Optimization; Big Data Analytics; Cloud;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The usage of cloud computing has lead to generation of petabytes of data just in a matter of second, which required a pivotal attention for analysis along with the storage. Although, storage issues in cloud has been solved to a large extent, but performing distributed analytical operation over the cloud is still a bigger challenge. The frequently used Hadoop MapReduce can perform distributed process modeling and inspite of its advantages, its pitfalls overshadow its potential advantageous features in terms of optimization. Hence, this paper presents a technique called as Algorithm for MapReduce Performance Optimization or AMPO for enhancing the performance of big data analytics. An analytical research methodology was adopted considering a case study of larger size traffic data to find that AMPO offers faster response time and lowered cost of resources as compared to the conventional MapReduce Programs without eliminating its major mapping and reducer operations.
引用
收藏
页码:717 / 723
页数:7
相关论文
共 27 条
[1]  
[Anonymous], 2013, The datacenter as a computer an introduction to the design of warehouse-scale machines
[2]  
C-Castello F. J., 2015, IEEE ACM 8 INT C UT
[3]  
Chen L.M., 2015, Mathematical Problems in Data Science
[4]   On the Performance of Byzantine Fault-Tolerant MapReduce [J].
Costa, Pedro ;
Pasin, Marcelo ;
Bessani, Alysson Neves ;
Correia, Miguel P. .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2013, 10 (05) :301-313
[5]  
Dang D., 2016, IEEE T PARALLEL DIST, V27
[6]  
Du W., 2015, IEEE 12 INT C FUZZ K
[7]  
Harrison Guy., 2015, Next Generation Databases: NoSQLand Big Data
[8]   On Traffic-Aware Partition and Aggregation in MapReduce for Big Data Applications [J].
Ke, Huan ;
Li, Peng ;
Guo, Song ;
Guo, Minyi .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (03) :818-828
[9]   Hybrid Job-Driven Scheduling for Virtual MapReduce Clusters [J].
Lee, Ming-Chang ;
Lin, Jia-Chun ;
Yahyapour, Ramin .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (06) :1687-1699
[10]  
Loshin D., 2013, Big data analytics: from strategic planning to enterprise integration with tools, techniques, NoSQL, and graph, DOI 10.1016/B978-0-12-417319-4.00009-0