Processing Big Data with Apache Hadoop in the Current Challenging Era of COVID-19

被引：15

作者：

Azeroual, Otmane ^{[1
]}

Fabre, Renaud ^{[2
]}

机构：

[1] German Ctr Higher Educ Res & Sci Studies DZHW, D-10117 Berlin, Germany

[2] Univ Paris 08, Dionysian Econ Lab LED, F-93200 St Denis, France

来源：

BIG DATA AND COGNITIVE COMPUTING | 2021年 / 5卷 / 01期

关键词：

big data; data processing; unstructured data; large amounts of data; COVID-19; challenges; Hadoop technology; MapReduce; WordCount; ANALYTICS;

D O I：

10.3390/bdcc5010012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Big data have become a global strategic issue, as increasingly large amounts of unstructured data challenge the IT infrastructure of global organizations and threaten their capacity for strategic forecasting. As experienced in former massive information issues, big data technologies, such as Hadoop, should efficiently tackle the incoming large amounts of data and provide organizations with relevant processed information that was formerly neither visible nor manageable. After having briefly recalled the strategic advantages of big data solutions in the introductory remarks, in the first part of this paper, we focus on the advantages of big data solutions in the currently difficult time of the COVID-19 pandemic. We characterize it as an endemic heterogeneous data context; we then outline the advantages of technologies such as Hadoop and its IT suitability in this context. In the second part, we identify two specific advantages of Hadoop solutions, globality combined with flexibility, and we notice that they are at work with a "Hadoop Fusion Approach" that we describe as an optimal response to the context. In the third part, we justify selected qualifications of globality and flexibility by the fact that Hadoop solutions enable comparable returns in opposite contexts of models of partial submodels and of models of final exact systems. In part four, we remark that in both these opposite contexts, Hadoop's solutions allow a large range of needs to be fulfilled, which fits with requirements previously identified as the current heterogeneous data structure of COVID-19 information. In the final part, we propose a framework of strategic data processing conditions. To the best of our knowledge, they appear to be the most suitable to overcome COVID-19 massive information challenges.

引用

页数：18

共 50 条

[41] Applications of Artificial Intelligence and Big Data for Covid-19 Pandemic: A Review
Liao, Maoxin
Zhu, Tingting
2022 9TH INTERNATIONAL CONFERENCE ON BIOMEDICAL AND BIOINFORMATICS ENGINEERING, ICBBE 2022, 2022, : 253 - 259
[42] Hadoop Paradigm for Satellite Environmental Big Data Processing
Semlali, Badr-Eddine Boudriki
El Amrani, Chaker
Ortiz, Guadalupe
INTERNATIONAL JOURNAL OF AGRICULTURAL AND ENVIRONMENTAL INFORMATION SYSTEMS, 2020, 11 (01) : 23 - 47
[43] Understanding changes in park visitation during the COVID-19 pandemic: A spatial application of big data
Rice, William L.
Pan, Bing
WELLBEING SPACE AND SOCIETY, 2021, 2
[44] Big data in genomic research for big questions with examples from covid-19 and other zoonoses
Wassenaar, Trudy M.
Ussery, David W.
Rosel, Adriana Cabal
JOURNAL OF APPLIED MICROBIOLOGY, 2023, 134 (01)
[45] Analyzing COVID-19 Epidemiological Data
Zhao, Chenru
Leung, Carson K.
Pazdor, Adam G. M.
Wen, Qi
2021 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS DASC/PICOM/CBDCOM/CYBERSCITECH 2021, 2021, : 985 - 990
[46] PERFORMANCE COMPARISON OF APACHE SPARK AND HADOOP FOR MACHINE LEARNING BASED ITERATIVE GBTR ON HIGGS AND COVID-19 DATASETS
Sewal, Piyush
Singh, Hari
SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (03): : 1373 - 1386
[47] PERFORMANCE COMPARISON OF APACHE SPARK AND HADOOP FOR MACHINE LEARNING BASED ITERATIVE GBTR ON HIGGS AND COVID-19 DATASETS
Sewal P.
Singh H.
Scalable Computing, 2024, 25 (03): : 1373 - 1386
[48] Architecture of Efficient Word Processing using Hadoop MapReduce for Big Data Applications
Mandal, Bichitra
Sahoo, Ramesh Kumar
Sethi, Srinivas
PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON MAN AND MACHINE INTERFACING (MAMI), 2015,
[49] Outlining Big Data Analytics in Health Sector with Special Reference to Covid-19
R. Biswas
Wireless Personal Communications, 2022, 124 : 2097 - 2108
[50] The application framework of big data technology during the COVID-19 pandemic in China
Chen, Wenyu
Yao, Ming
Dong, Liang
Shao, Pingyang
Zhang, Ye
Fu, Binjie
EPIDEMIOLOGY & INFECTION, 2022, 150

← 1 2 3 4 5 →