Developing a data pipeline solution for big data processing

被引:2
作者
Lipovac, Ivona [1 ]
Babac, Marina Bagic [1 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, HR-10000 Zagreb, Croatia
关键词
big data; data pipeline; data processing; data analysis; cloud computing;
D O I
10.1504/IJDMMM.2024.136221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a comprehensive exploration of the concept of big data and its management while highlighting the challenges that arise in the process. The study showcases the development of a data pipeline, designed to facilitate big data collection, integration, and analysis while addressing state-of-the-art challenges, methods, tools, and technologies. Emphasis is placed on pipeline flexibility, with a view towards enabling ease of implementation of architecture changes, seamless integration of new sources, and straightforward implementation of additional transformations in existing pipelines as needed. The pipeline architecture is discussed in detail, with a focus on its design principles, components, and implementation details, as well as the mechanisms used to ensure its reliability, scalability, and performance. Results from a range of experiments demonstrate the pipeline's effectiveness in addressing the challenges of big data management and analysis, as well as its robustness and versatility in accommodating diverse data sources and processing requirements. This study provides insights into the critical role of data pipelines in enabling effective big data management and showcases the importance of flexibility in pipeline design to ensure adaptability to evolving data processing needs.
引用
收藏
页码:1 / 22
页数:23
相关论文
共 50 条
  • [41] Developing Concept Enriched Models for Big Data Processing Within the Medical Domain
    Gudivada, Akhil
    Philips, James
    Tabrizi, Nasseh
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2020, 12 (03): : 55 - 71
  • [42] Analysis and processing aspects of data in big data applications
    Rahul, Kumar
    Banyal, Rohitash Kumar
    Goswami, Puneet
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2020, 23 (02) : 385 - 393
  • [43] Federated Query processing for Big Data in Data Science
    Muniswamaiah, Manoj
    Agerwala, Tilak
    Tappert, Charles C.
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 6145 - 6147
  • [44] A Big Data Processing Platform for Medical Records in Cloud
    Yang, Chao-Tung
    Liu, Jung-Chun
    Lu, Hsin-Wen
    Yan, Yin-Zhen
    Chu, Cheng-Chung
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1406 - 1415
  • [45] Big Data Processing and Access Controls in cloud Environment
    Reddy, Yenumula B.
    2018 IEEE 4TH INTERNATIONAL CONFERENCE ON BIG DATA SECURITY ON CLOUD (BIGDATASECURITY), 4THIEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE AND SMART COMPUTING, (HPSC) AND 3RD IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY (IDS), 2018, : 25 - 33
  • [46] Big Data Processing for Pervasive Environment in Cloud Computing
    Amato, Alba
    Di Martino, Beniamino
    Venticinque, Salvatore
    2014 INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS), 2014, : 598 - 603
  • [47] Comparative analysis of virtualization methods in Big Data processing
    Radchenko G.I.
    Alaasam A.B.A.
    Tchernykh A.N.
    Supercomputing Frontiers and Innovations, 2019, 6 (01) : 48 - 79
  • [48] OpenStack Platform and its Application in Big Data Processing
    Shao, Cen
    Liang, Bo
    Wang, Feng
    Deng, Hui
    Dai, Wei
    Wei, Shoulin
    Zhang, Xiaoli
    Yuan, Zhi
    2015 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT NETWORKS AND INTELLIGENT SYSTEMS (ICINIS), 2015, : 98 - 101
  • [49] Big Data processing technology research and application prospects
    Mu, Li
    Lei, Zhu
    2014 FOURTH INTERNATIONAL CONFERENCE ON INSTRUMENTATION AND MEASUREMENT, COMPUTER, COMMUNICATION AND CONTROL (IMCCC), 2014, : 269 - 273
  • [50] PROCESSING OF BIG DATA IN INTERNET OF THINGS AND PRECISION AGRICULTURE
    Stoces, Michal
    Masner, Jan
    Kanska, Eva
    Jarolimek, Jan
    AGRARIAN PERSPECTIVES XXVII - FOOD SAFETY - FOOD SECURITY, 2018, : 353 - 358