A Comparative Study of Big Data Processing : Hadoop vs. Spark

被引:0
作者
Sharma, Meghna [1 ]
Kaur, Jagdeep [1 ]
机构
[1] NorthCap Univ, Comp Sci & Engn, Gurugram, Haryana, India
来源
PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM) | 2019年
关键词
Big Data; Hadoop; Spark; In-Memory Processing; On-Disk Processing;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Apache Spark and Hadoop's MapReduce are two very important tools used for Big Data processing. The processing started with Hadoop's MapReduce Framework but suffers from many disadvantages due to multiple disc processing operations. The drawbacks of the traditional big data processing have been overcome by in memory handling framework like Spark. In some aspects they go hand in hand as due to lack of file system in Spark, it needs to depend upon MapReduce. This paper has shown the extensive study on various tools related to Big Data processing and has done extensive comparison on MapReduce Vs Spark. The frameworks have been studied on real time datasets and finally compared in terms of processing time. Spark showing the remarkable improvement over MapReduce.
引用
收藏
页码:1073 / 1077
页数:5
相关论文
共 16 条
[11]  
Shaw S., 2016, Practical Hive: A Guide to Hadoop's Data Warehouse System, P37, DOI [10.1007/978-1-4842-0271-5_3, DOI 10.1007/978-1-4842-0271-5_3]
[12]  
Shireesha R., 2016, IJACTA, V4, P152
[13]   Integrated security infrastructures for law enforcement agencies [J].
Stoianov, Nikolai ;
Uruena, Manuel ;
Niemiec, Marcin ;
Machnik, Petr ;
Maestro, Gema .
MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (12) :4453-4468
[14]  
Usha D., 2014, Int. J. Current Eng. Technol., V4, P602
[15]  
Vohra D., 2016, Pro Docker, P151, DOI [10.1007/978-1-4842-1830-3_11, DOI 10.1007/978-1-4842-1830-3_11]
[16]  
White T, 2012, HADOOP DEFINITIVE GU