Towards Smart Data Selection From Tithe Series Using Statistical Methods

被引:2
|
作者
Gil, Amaia [1 ,2 ]
Quartulli, Marco [1 ]
Olaizola, Igor G. [1 ]
Sierra, Basilio [2 ]
机构
[1] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Donostia San Sebastian 20009, Spain
[2] Univ Basque Country UPV EHU, Dept Comp Sci & Artificial Intelligence, Donostia San Sebastian 20018, Spain
关键词
Data selection; machine learning; optimization; time series; TIME-SERIES; M4;
D O I
10.1109/ACCESS.2021.3066686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transmitting and storing large volumes of dynamic / time series data collected by modern sensors can represent a significant technological challenge. A possibility to mitigate this challenge is to effectively select a subset of significant data points in order to reduce data volumes without sacrificing the quality of the results of the subsequent analysis. This paper proposes a method for adaptively identifying optimal data point selection algorithms for sensor time series on a window-by-window basis. Thus, this contribution focuses on quantifying the effect of the application of data selection algorithms to time series windows. The proposed approach is first used on multiple synthetically generated time series obtained by concatenating multiple sources one after the other, and then validated in the entire UCR time series public data archive.
引用
收藏
页码:44390 / 44401
页数:12
相关论文
共 50 条
  • [1] Extended statistical downscaling model using data mining methods for modeling hydrologic time series
    Shi, Yuqing
    Zhu, Yuelong
    Journal of Computational and Theoretical Nanoscience, 2015, 12 (12) : 5081 - 5088
  • [2] Variable selection for prediction of time series from smart city
    Macas, Martin
    2015 SMART CITIES SYMPOSIUM PRAGUE (SCSP), 2015,
  • [3] Modeling MODIS LAI time series using three statistical methods
    Jiang, Bo
    Liang, Shunlin
    Wang, Jindi
    Xiao, Zhiqiang
    REMOTE SENSING OF ENVIRONMENT, 2010, 114 (07) : 1432 - 1444
  • [4] Looking for dependencies in short time series using imprecise statistical data
    Hryniewicz, Olgierd
    Theoretical Advances and Applications of Fuzzy Logic and Soft Computing, 2007, 42 : 201 - 208
  • [5] Comparing Statistical and Machine Learning Methods for Time Series Forecasting in Data-Driven Logistics-A Simulation Study
    Schmid, Lena
    Roidl, Moritz
    Kirchheim, Alice
    Pauly, Markus
    ENTROPY, 2025, 27 (01)
  • [6] Survey of data-selection methods in statistical machine translation
    Eetemadi, Sauleh
    Lewis, William
    Toutanova, Kristina
    Radha, Hayder
    MACHINE TRANSLATION, 2015, 29 (3-4) : 189 - 223
  • [7] Selection of Temporal Lags for Predicting Riverflow Series from Hydroelectric Plants Using Variable Selection Methods
    Siqueira, Hugo
    Macedo, Mariana
    Tadano, Yara de Souza
    Alves, Thiago Antonini
    Stevan Jr, Sergio L.
    Oliveira Jr, Domingos S.
    Marinho, Manoel H. N.
    de Mattos Neto, Paulo S. G.
    de Oliveira, Joao F. L.
    Luna, Ivette
    Leone Filho, Marcos de Almeida
    Sarubbo, Leonie Asfora
    Converti, Attilio
    ENERGIES, 2020, 13 (16)
  • [8] Photovoltaic power forecasting using statistical methods: impact of weather data
    De Giorgi, Maria Grazia
    Congedo, Paolo Maria
    Malvoni, Maria
    IET SCIENCE MEASUREMENT & TECHNOLOGY, 2014, 8 (03) : 90 - 97
  • [9] Viable forecasting monthly weather data using time series methods
    Soomro, Ramzan
    Ghauri, Saghir Pervaiz
    Marri, Azhar Ali
    Vambol, Sergij
    Dung, Hoang Thi
    Manzoor, Nazish
    Bano, Shella
    Shahid, Sana
    Asadullah
    Farooq, Ahmed
    Lutsenko, Yurii
    ECOLOGICAL QUESTIONS, 2023, 34 (01)
  • [10] A study of time series forecasting using statistical methods, machine learning methods and deep learning: historical aspects
    Kitov, V. V.
    Mishustina, M., V
    Ustyuzhanin, A. O.
    VOPROSY ISTORII, 2022, 4 (02) : 201 - 218