New and Existing Approaches Reviewing of Big Data Analysis with Hadoop Tools

被引:5
作者
Mutasher, Watheq Ghanim [1 ]
Aljuboori, Abbas Fadhil [2 ]
机构
[1] Informat Inst Postgrad Studies, Baghdad, Iraq
[2] Univ Informat Technol & Commun, Baghdad, Iraq
关键词
Apache-Spark; Big Data; Hadoop; IOT; Social Media;
D O I
10.21123/bsj.2022.19.4.0887
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Everybody is connected with social media like (Facebook, Twitter, LinkedIn, Instagram ... etc.) that generate a large quantity of data and which traditional applications are inadequate to process. Social media are regarded as an important platform for sharing information, opinion, and knowledge of many subscribers. These basic media attribute Big data also to many issues, such as data collection, storage, moving, updating, reviewing, posting, scanning, visualization, Data protection, etc. To deal with all these problems, this is a need for an adequate system that not just prepares the details, but also provides meaningful analysis to take advantage of the difficult situations, relevant to business, proper decision, Health, social media, science, telecommunications, the environment, etc. Authors notice through reading of previous studies that there are different analyzes through HADOOP and its various tools such as the sentiment in real-time and others. However, dealing with this Big data is a challenging task. Therefore, such type of analysis is more efficiently possible only through the Hadoop Ecosystem. The purpose of this paper is to analyze literature related analysis of big data of social media using the Hadoop framework for knowing almost analysis tools existing in the world under the Hadoop umbrella and its orientations in addition to difficulties and modern methods of them to overcome challenges of big data in offline and real-time processing. Real-time Analytics accelerates decision-making along with providing access to business metrics and reporting. Comparison between Hadoop and spark has been also illustrated.
引用
收藏
页码:887 / 898
页数:12
相关论文
共 25 条
  • [1] Adhikari Binod Kumar, 2019, 2019 IEEE 21st International Conference on High Performance Computing and Communications
  • [2] IEEE 17th International Conference on Smart City
  • [3] IEEE 5th International Conference on Data Science and Systems (HPCC/SmartCity/DSS). Proceedings, P2373, DOI 10.1109/HPCC/SmartCity/DSS.2019.00330
  • [4] Defining Human Behaviors using Big Data Analytics in Social Internet of Things
    Ahmad, Awais
    Rathore, M. Mazhar
    Paul, Anand
    Rho, Suengmin
    [J]. IEEE 30TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS IEEE AINA 2016, 2016, : 1101 - 1107
  • [5] To Study and Analyse Human Behaviours on Social Networks
    Amin, Farhan
    Ahmad, Awais
    Choi, Gyu-Sang
    [J]. 2018 4TH ANNUAL INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC 2018), 2018, : 233 - 236
  • [6] Ashayer A, 2019, PROC 20 INT C HIGH P, P805
  • [7] Ashwitha TA, 2018, 2 INT C COMPUT SYST, P1
  • [8] Bhardwaj A, 2016, 12 IEEE INT C ELECT
  • [9] Bhardwaj A, 2015, 2015 2ND INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN ENGINEERING & COMPUTATIONAL SCIENCES (RAECS)
  • [10] Analyzing Social Media through Big Data using InfoSphere BigInsights and Apache Flume
    Birjali, Marouane
    Beni-Hssane, Abderrahim
    Erritali, Mohammed
    [J]. 8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 280 - 285