Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19

被引:15
作者
Azeroual, Otmane [1 ]
Fabre, Renaud [2 ]
机构
[1] German Ctr Higher Educ Res & Sci Studies DZHW, D-10117 Berlin, Germany
[2] Univ Paris 08, Dionysian Econ Lab LED, F-93200 St Denis, France
关键词
big data; data processing; unstructured data; large amounts of data; COVID-19; challenges; Hadoop technology; MapReduce; WordCount; ANALYTICS;
D O I
10.3390/bdcc5010012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting. As experienced in former massive information issues, big data technologies, such as Hadoop, should efficiently tackle the incoming large amounts of data and provide organizations with relevant processed information that was formerly neither visible nor manageable. After having briefly recalled the strategic advantages of big data solutions in the introductory remarks, in the first part of this paper, we focus on the advantages of big data solutions in the currently difficult time of the COVID-19 pandemic. We characterize it as an endemic heterogeneous data context; we then outline the advantages of technologies such as Hadoop and its IT suitability in this context. In the second part, we identify two specific advantages of Hadoop solutions, globality combined with flexibility, and we notice that they are at work with a "Hadoop Fusion Approach" that we describe as an optimal response to the context. In the third part, we justify selected qualifications of globality and flexibility by the fact that Hadoop solutions enable comparable returns in opposite contexts of models of partial submodels and of models of final exact systems. In part four, we remark that in both these opposite contexts, Hadoop's solutions allow a large range of needs to be fulfilled, which fits with requirements previously identified as the current heterogeneous data structure of COVID-19 information. In the final part, we propose a framework of strategic data processing conditions. To the best of our knowledge, they appear to be the most suitable to overcome COVID-19 massive information challenges.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Applications of Artificial Intelligence and Big Data for Covid-19 Pandemic: A Review
    Liao, Maoxin
    Zhu, Tingting
    2022 9TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2022, 2022, : 253 - 259
  • [42] Hadoop Paradigm for Satellite Environmental Big Data Processing
    Semlali, Badr-Eddine Boudriki
    El Amrani, Chaker
    Ortiz, Guadalupe
    INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 23 - 47
  • [43] Understanding changes in park visitation during the COVID-19 pandemic: A spatial application of big data
    Rice, William L.
    Pan, Bing
    WELLBEING SPACE AND SOCIETY, 2021, 2
  • [44] Big data in genomic research for big questions with examples from covid-19 and other zoonoses
    Wassenaar, Trudy M.
    Ussery, David W.
    Rosel, Adriana Cabal
    JOURNAL OF APPLIED MICROBIOLOGY, 2023, 134 (01)
  • [45] Analyzing COVID-19 Epidemiological Data
    Zhao, Chenru
    Leung, Carson K.
    Pazdor, Adam G. M.
    Wen, Qi
    2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 985 - 990
  • [46] PERFORMANCE COMPARISON OF APACHE SPARK AND HADOOP FOR MACHINE LEARNING BASED ITERATIVE GBTR ON HIGGS AND COVID-19 DATASETS
    Sewal, Piyush
    Singh, Hari
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1373 - 1386
  • [47] PERFORMANCE COMPARISON OF APACHE SPARK AND HADOOP FOR MACHINE LEARNING BASED ITERATIVE GBTR ON HIGGS AND COVID-19 DATASETS
    Sewal P.
    Singh H.
    Scalable Computing, 2024, 25 (03): : 1373 - 1386
  • [48] Architecture of Efficient Word Processing using Hadoop MapReduce for Big Data Applications
    Mandal, Bichitra
    Sahoo, Ramesh Kumar
    Sethi, Srinivas
    PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON MAN AND MACHINE INTERFACING (MAMI), 2015,
  • [49] Outlining Big Data Analytics in Health Sector with Special Reference to Covid-19
    R. Biswas
    Wireless Personal Communications, 2022, 124 : 2097 - 2108
  • [50] The application framework of big data technology during the COVID-19 pandemic in China
    Chen, Wenyu
    Yao, Ming
    Dong, Liang
    Shao, Pingyang
    Zhang, Ye
    Fu, Binjie
    EPIDEMIOLOGY & INFECTION, 2022, 150