A Priori Study on Factors Affecting MapReduce Performance in Cloud-Based Environment

被引:0
作者
Vijay, Vandana [1 ]
Nanda, Ruchi [1 ]
机构
[1] Deemed Be Univ, Dept CS & IT, IIS, Jaipur, Rajasthan, India
来源
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 3 | 2023年 / 464卷
关键词
Big data; MapReduce; Hadoop; HDFS; Cloud computing;
D O I
10.1007/978-981-19-2394-4_46
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the current era, global data have been rising at a very fast speed due to the excessive use of technologies including cloud and IoT. It leads to the development of big data that can handle and analyze a high volume of data regularly. Cloud computing provides a reliable, available, and scalable environment for the processing of this huge data. MapReduce has become an important computing model for processing and generating high-volume datasets on a cluster of machines. It not only allows distributed processing but also significant attributes like flexibility, versatility, load adjusting, and adaptation to internal failure. Despite these benefits, the performance of this framework gets affected by multiple factors, namely indexing, data skew, joining, caching, and load balancing. The main objective of this paper is to identify the factors to resolve its performance issues and investigate alternate strategies to improve the MapReduce query performance in the cloud-based environment.
引用
收藏
页码:509 / 515
页数:7
相关论文
共 13 条
[1]  
Basha S, 2016, IJSRSET, V2, P126
[2]   Only Aggressive Elephants are Fast Elephants [J].
Dittrich, Jens ;
Quiane-Ruiz, Jorge-Arnulfo ;
Richter, Stefan ;
Schuh, Stefan ;
Jindal, Alekh ;
Schad, Joerg .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (11) :1591-1602
[3]   AutoCache: Employing Machine Learning to Automate Caching in Distributed File Systems [J].
Herodotou, Herodotos .
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW 2019), 2019, :133-139
[4]  
Huang LJ, 2012, INT C NUMER SIMUL, P59, DOI 10.1109/NUSOD.2012.6316506
[5]   A novel algorithm for handling reducer side data skew in MapReduce based on a learning automata game [J].
Irandoost, Mohammad Amin ;
Rahmani, Amir Masoud ;
Setayeshi, Saeed .
INFORMATION SCIENCES, 2019, 501 :662-679
[6]   The Performance of MapReduce: An In-depth Study [J].
Jiang, Dawei ;
Ooi, Beng Chin ;
Shi, Lei ;
Wu, Sai .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01) :472-483
[7]   Task failure resilience technique for improving the performance of MapReduce in Hadoop [J].
Kavitha, C. ;
Anita, X. .
ETRI JOURNAL, 2020, 42 (05) :751-763
[8]  
Khafagy M. H., 2015, Int J Sci Eng Res, V6, P705
[9]  
Kumar V, 2017, Int J Comput Sci Mob Comput, V6, P389
[10]   An efficient theta-join query processing in distributed environment [J].
Liu, Wenjie ;
Li, Zhanhuai .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2018, 121 :42-52