Building Advanced Web Applications Using Data Ingestion and Data Processing Tools

被引:1
作者
Sprem, Simun [1 ]
Tomazin, Nikola [1 ]
Matecic, Jelena [1 ]
Horvat, Marko [2 ]
机构
[1] Agrokor Energija Doo, Trg Drazena Petrovica 3, HR-10000 Zagreb, Croatia
[2] Univ Zagreb, Fac Elect Engn & Comp, Dept Appl Comp, Unska 3, HR-10000 Zagreb, Croatia
关键词
data engineering; big data analytics; big data management; data acquisition; data ingestion; change data capture (CDC); data analysis tools; real-time data stream processing;
D O I
10.3390/electronics13040709
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Today, advanced websites serve as robust data repositories that constantly collect various user-centered information and prepare it for subsequent processing. The data collected can include a wide range of important information from email addresses, usernames, and passwords to demographic information such as age, gender, and geographic location. User behavior metrics are also collected, including browsing history, click patterns, and time spent on pages, as well as different preferences like product selection, language preferences, and individual settings. Interactions, device information, transaction history, authentication data, communication logs, and various analytics and metrics contribute to the comprehensive range of user-centric information collected by websites. A method to systematically ingest and transfer such differently structured information to a central message broker is thoroughly described. In this context, a novel tool-Dataphos Publisher-for the creation of ready-to-digest data packages is presented. Data acquired from the message broker are employed for data quality analysis, storage, conversion, and downstream processing. A brief overview of the commonly used and freely available tools for data ingestion and processing is also provided.
引用
收藏
页数:23
相关论文
共 50 条
  • [41] Comparison of machine learning algorithms for evaluating building energy efficiency using big data analytics
    Egwim, Christian Nnaemeka
    Alaka, Hafiz
    Egunjobi, Oluwapelumi Oluwaseun
    Gomes, Alvaro
    Mporas, Iosif
    JOURNAL OF ENGINEERING DESIGN AND TECHNOLOGY, 2024, 22 (04) : 1325 - 1350
  • [42] Abstraction of meso-scale network architecture in granular ensembles using 'big data analytics' tools
    Kishore, Raj
    Krishnan, R.
    Satpathy, Manoranjan
    Nussinov, Zohar
    Sahu, Kisor K.
    JOURNAL OF PHYSICS COMMUNICATIONS, 2018, 2 (03):
  • [43] Improving big data analytics data processing speed through map reduce scheduling and replica placement with HDFS using genetic optimization techniques
    Sundara Kumar M.R.
    Mohan H.S.
    Journal of Intelligent and Fuzzy Systems, 2024, 46 (04) : 10863 - 10882
  • [44] A user configurable data acquisition and signal processing system for high-rate, high channel count applications
    Salim, Arwa
    Crockett, Louise
    McLean, John
    Milne, Peter
    FUSION ENGINEERING AND DESIGN, 2012, 87 (12) : 2174 - 2177
  • [45] Role of big data analytics and information processing capabilities in enhancing transparency and accountability in e-procurement applications
    Alnuaimi, Nasser Tuwali
    CHatha, Kamran Ali
    Abdallah, Salam
    JOURNAL OF ENGINEERING DESIGN AND TECHNOLOGY, 2024,
  • [46] Achieving data security and privacy across healthcare applications using cyber security mechanisms
    Zhu, Shanying
    Saravanan, Vijayalakshmi
    Muthu, BalaAnand
    ELECTRONIC LIBRARY, 2020, 38 (5-6) : 979 - 995
  • [47] Using Natural Language Processing and Data Mining for Forecasting Consumer Spending Through Social Media
    Mostafa, Noha
    Abdelazim, Kholoud
    Grida, Mohamed
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 4, INTELLISYS 2023, 2024, 825 : 882 - 901
  • [48] ACCELERATING BIG DATA PROCESSING CHAIN IN IMAGE INFORMATION MINING USING A HYBRID HPC APPROACH
    Kurte, Kuldeep R.
    Bhangale, Ujwala M.
    Durbha, Surya S.
    King, Roger L.
    Younan, Nicolas H.
    2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, : 7597 - 7600
  • [49] Big Data Approaches for the Analysis of Large-Scale fMRI Data Using Apache Spark and GPU Processing: A Demonstration on Resting-State fMRI Data from the Human Connectome Project
    Boubela, Roland N.
    Kalcher, Klaudius
    Huf, Wolfgang
    Nasel, Christian
    Moser, Ewald
    FRONTIERS IN NEUROSCIENCE, 2016, 9
  • [50] Foundation Treatment in Urban Underground Engineering Using Big Data Analysis for Smart City Applications
    Liu, Fei
    Zhang, Yunkai
    Zhang, Jian
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2022, 132 (01): : 153 - 172