SymED: Adaptive and Online Symbolic Representation of Data on the Edge

被引:1
作者
Hofstaetter, Daniel [1 ]
Ilager, Shashikant [1 ]
Lujic, Ivan [2 ]
Brandic, Ivona [1 ]
机构
[1] Vienna Univ Technol, Vienna, Austria
[2] Ericsson Nikola Tesla, Zagreb, Croatia
来源
EURO-PAR 2023: PARALLEL PROCESSING | 2023年 / 14100卷
基金
奥地利科学基金会;
关键词
Internet of Things; Edge computing; Symbolic data representation; Edge storage and analytics; Data compression; Time series; DATA-COMPRESSION;
D O I
10.1007/978-3-031-39698-4_28
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The edge computing paradigm helps handle the Internet of Things (IoT) generated data in proximity to its source. Challenges occur in transferring, storing, and processing this rapidly growing amount of data on resource-constrained edge devices. Symbolic Representation (SR) algorithms are promising solutions to reduce the data size by converting actual raw data into symbols. Also, they allow data analytics (e.g., anomaly detection and trend prediction) directly on symbols, benefiting large classes of edge applications. However, existing SR algorithms are centralized in design and work offline with batch data, which is infeasible for real-time cases. We propose SymED - Symbolic Edge Data representation method, i.e., an online, adaptive, and distributed approach for symbolic representation of data on edge. SymED is based on the Adaptive Brownian Bridge-based Aggregation (ABBA), where we assume low-powered IoT devices do initial data compression (senders) and the more robust edge devices do the symbolic conversion (receivers). We evaluate SymED by measuring compression performance, reconstruction accuracy through Dynamic Time Warping (DTW) distance, and computational latency. The results show that SymED is able to (i) reduce the raw data with an average compression rate of 9.5%; (ii) keep a low reconstruction error of 13.25 in the DTW space; (iii) simultaneously provide real-time adaptability for online streaming IoT data at typical latencies of 42ms per symbol, reducing the overall network traffic.
引用
收藏
页码:411 / 425
页数:15
相关论文
共 22 条
  • [1] Attig A., 2011, T CASE BASED REASON, V4, P3
  • [2] An energy efficient IoT data compression approach for edge machine learning
    Azar, Joseph
    Makhoul, Abdallah
    Barhamgi, Mahmoud
    Couturier, Raphael
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 96 : 168 - 175
  • [3] Berndt J., 1994, P KDD WORKSH SEATTL, V10, P359, DOI DOI 10.5555/3000850.3000887
  • [4] Dau H.A., 2019, The UCR time series classification archive
  • [5] ABBA: adaptive Brownian bridge-based symbolic aggregation of time series
    Elsworth, Steven
    Guttel, Stefan
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2020, 34 (04) : 1175 - 1200
  • [6] Information Abstraction for Heterogeneous Real World Internet Data
    Ganz, Frieder
    Barnaghi, Payam
    Carrez, Francois
    [J]. IEEE SENSORS JOURNAL, 2013, 13 (10) : 3793 - 3805
  • [7] Gupta V., 2019, P 2019 3 INT C BIG D, P12
  • [8] Hofstatter D., 2023, ARTIFACT INSTRUCTION, DOI [10.6084/m9.figshare.23536992, DOI 10.6084/M9.FIGSHARE.23536992]
  • [9] Khan MA, 2014, 2014 NATIONAL SOFTWARE ENGINEERING CONFERENCE (NSEC - 2014), P61, DOI 10.1109/NSEC.2014.6998242
  • [10] A Knowledge-based Approach for Real-Time IoT Data Stream Annotation and Processing
    Kolozali, Sefki
    Bermudez-Edo, Maria
    Puschmann, Daniel
    Ganz, Frieder
    Barnaghi, Payam
    [J]. 2014 IEEE International Conference (iThings) - 2014 IEEE International Conference on Green Computing and Communications (GreenCom) - 2014 IEEE International Conference on Cyber-Physical-Social Computing (CPS), 2014, : 215 - 222