An efficient approach for low latency processing in stream data

被引:2
|
作者
Bhatt, Nirav [1 ]
Thakkar, Amit [2 ]
机构
[1] CHARUSAT, Chandubhai S Patel Inst Technol, Informat Technol, Anand, Gujarat, India
[2] CHARUSAT, Chandubhai S Patel Inst Technol, Comp Sci & Engn, Anand, Gujarat, India
关键词
Data stream; Stream processing; Latency; SYSTEMS; GOLD; OIL;
D O I
10.7717/peerj-cs.426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stream data is the data that is generated continuously from the different data sources and ideally defined as the data that has no discrete beginning or end. Processing the stream data is a part of big data analytics that aims at querying the continuously arriving data and extracting meaningful information from the stream. Although earlier processing of such stream was using batch analytics, nowadays there are applications like the stock market, patient monitoring, and traffic analysis which can cause a drastic difference in processing, if the output is generated in levels of hours and minutes. The primary goal of any real-time stream processing system is to process the stream data as soon as it arrives. Correspondingly, analytics of the stream data also needs consideration of surrounding dependent data. For example, stock market analytics results are often useless if we do not consider their associated or dependent parameters which affect the result. In a real-world application, these dependent stream data usually arrive from the distributed environment. Hence, the stream processing system has to be designed, which can deal with the delay in the arrival of such data from distributed sources. We have designed the stream processing model which can deal with all the possible latency and provide an end-to-end low latency system. We have performed the stock market prediction by considering affecting parameters, such as USD, OIL Price, and Gold Price with an equal arrival rate. We have calculated the Normalized Root Mean Square Error (NRMSE) which simplifies the comparison among models with different scales. A comparative analysis of the experiment presented in the report shows a significant improvement in the result when considering the affecting parameters. In this work, we have used the statistical approach to forecast the probability of possible data latency arrives from distributed sources. Moreover, we have performed preprocessing of stream data to ensure at-least-once delivery semantics. In the direction towards providing low latency in processing, we have also implemented exactly-once processing semantics. Extensive experiments have been performed with varying sizes of the window and data arrival rate. We have concluded that system latency can be reduced when the window size is equal to the data arrival rate.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [1] TurboStream: Towards Low-Latency Data Stream Processing
    Wu, Song
    Liu, Mi
    Ibrahim, Shadi
    Jin, Hai
    Gu, Lin
    Chen, Fei
    Liu, Zhiyi
    2018 IEEE 38TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS), 2018, : 983 - 993
  • [2] Keep calm and react with foresight: strategies for low-latency and energy-efficient elastic data stream processing
    De Matteis, Tiziano
    Mencagli, Gabriele
    ACM SIGPLAN NOTICES, 2016, 51 (08) : 149 - 160
  • [3] On Efficient Processing of Linked Stream Data
    Saleh, Omran
    Sattler, Kai-Uwe
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2014 CONFERENCES, 2014, 8841 : 700 - 717
  • [4] Efficient Stream Processing of Scientific Data
    Lindemann, Thomas
    Kauke, Jonas
    Teubner, Jens
    2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2018, : 140 - 145
  • [5] An Efficient Approach for Storage of Big Data Streams in Distributed Stream Processing Systems
    Alshamrani, Sultan
    Waseem, Quadri
    Alharbi, Abdullah
    Alosaimi, Wael
    Turabieh, Hamza
    Alyami, Hashem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (05) : 91 - 98
  • [6] Efficient Processing of Semi-Stream Data
    Naeem, M. Asif
    2013 EIGHTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT (ICDIM), 2013, : 7 - 10
  • [7] TerseCades: Efficient Data Compression in Stream Processing
    Pekhimenko, Gennady
    Guo, Chuanxiong
    Jeon, Myeongjae
    Huang, Peng
    Zhou, Lidong
    PROCEEDINGS OF THE 2018 USENIX ANNUAL TECHNICAL CONFERENCE, 2018, : 307 - 320
  • [8] Efficient Digital Twin Data Processing for Low-Latency Multicast Short Video Streaming
    Huang, Xinyu
    Hu, Shisheng
    Li, Mushu
    Huang, Cheng
    Shen, Xuemin
    2024 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS IN CHINA, ICCC, 2024,
  • [9] Elastic Stream Processing with Latency Guarantees
    Lohrmann, Bjoern
    Janacik, Peter
    Kao, Odej
    2015 IEEE 35TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, 2015, : 399 - 410
  • [10] An Approach for Meteorological Data Integration and Stream Processing
    Hdafa, Mohamed
    Zouhairi, Youssef
    Lemoudden, Mouad
    Ziyati, Elhoussaine
    PROCEEDINGS OF 2016 THIRD INTERNATIONAL CONFERENCE ON SYSTEMS OF COLLABORATION (SYSCO), 2016, : P96 - P100