Towards Smart Data Selection From Tithe Series Using Statistical Methods

被引:2
|
作者
Gil, Amaia [1 ,2 ]
Quartulli, Marco [1 ]
Olaizola, Igor G. [1 ]
Sierra, Basilio [2 ]
机构
[1] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Donostia San Sebastian 20009, Spain
[2] Univ Basque Country UPV EHU, Dept Comp Sci & Artificial Intelligence, Donostia San Sebastian 20018, Spain
关键词
Data selection; machine learning; optimization; time series; TIME-SERIES; M4;
D O I
10.1109/ACCESS.2021.3066686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transmitting and storing large volumes of dynamic / time series data collected by modern sensors can represent a significant technological challenge. A possibility to mitigate this challenge is to effectively select a subset of significant data points in order to reduce data volumes without sacrificing the quality of the results of the subsequent analysis. This paper proposes a method for adaptively identifying optimal data point selection algorithms for sensor time series on a window-by-window basis. Thus, this contribution focuses on quantifying the effect of the application of data selection algorithms to time series windows. The proposed approach is first used on multiple synthetically generated time series obtained by concatenating multiple sources one after the other, and then validated in the entire UCR time series public data archive.
引用
收藏
页码:44390 / 44401
页数:12
相关论文
共 50 条
  • [31] Time-Series Forecasting of Seasonal Data Using Machine Learning Methods
    Kramar, Vadim
    Alchakov, Vasiliy
    ALGORITHMS, 2023, 16 (05)
  • [32] FINANCIAL DATA ANALYSIS USING NONLINEAR TIME SERIES METHODS. FLUCTUATIONS INTERPRETATION OF FOREIGN CURRENCY EXCHANGE RATES
    Ciucu, Stefan Cristian
    Paun, Viorel-Puiu
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN-SERIES A-APPLIED MATHEMATICS AND PHYSICS, 2015, 77 (01): : 235 - 248
  • [33] Forecasting for a data-driven policy using time series methods in handling COVID-19 pandemic in Jakarta
    Sulasikin, Andi
    Nugraha, Yudhistira
    Kanggrawan, Juan
    Suherman, Alex L.
    2020 IEEE INTERNATIONAL SMART CITIES CONFERENCE (ISC2), 2020,
  • [34] Feature selection methods for big data bioinformatics: A survey from the search perspective
    Wang, Lipo
    Wang, Yaoli
    Chang, Qing
    METHODS, 2016, 111 : 21 - 31
  • [35] VG-Prefetcher Cache: Towards Edge-Based Time Series Data Management Using Visibility Graph Prefetching
    Bensalem, Akram
    D'Orazio, Laurent
    Lallet, Julien
    Enrici, Andrea
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT 36TH INTERNATIONAL CONFERENCE, SSDBM 2024, 2024,
  • [36] A Survey of Preprocessing Methods Used for Analysis of Big Data Originated From Smart Grids
    Alghamdi, Turki Ali
    Javaid, Nadeem
    IEEE ACCESS, 2022, 10 : 29149 - 29171
  • [37] Crop Classification and Representative Crop Rotation Identifying Using Statistical Features of Time-Series Sentinel-1 GRD Data
    Zhou, Xin
    Wang, Jinfei
    He, Yongjun
    Shan, Bo
    REMOTE SENSING, 2022, 14 (20)
  • [38] Analysis of Time Series Data Generated From the Internet of Things Using Deep Learning Models
    Yakoi, Polycarp Shizawaliyi
    Meng, Xiangfu
    Cui, Shuolin
    Suleman, Danladi
    Yang, Xueyong
    IEEE ACCESS, 2023, 11 : 133313 - 133328
  • [39] Information Extraction from Industrial Sensor Data Using Time Series Meta-Features
    Grabowski, Niclas
    Kremser, Ron
    Duessel, Roman
    Mulder, Albert
    Tutsch, Dietmar
    APPLIED SCIENCES-BASEL, 2023, 13 (12):
  • [40] Model Order Selection From Noisy Polynomial Data Without Using Any Polynomial Coefficients
    Nandi, Asoke K.
    IEEE ACCESS, 2020, 8 : 130417 - 130430