Towards Smart Data Selection From Tithe Series Using Statistical Methods

被引:2
|
作者
Gil, Amaia [1 ,2 ]
Quartulli, Marco [1 ]
Olaizola, Igor G. [1 ]
Sierra, Basilio [2 ]
机构
[1] Vicomtech Fdn, Basque Res & Technol Alliance BRTA, Donostia San Sebastian 20009, Spain
[2] Univ Basque Country UPV EHU, Dept Comp Sci & Artificial Intelligence, Donostia San Sebastian 20018, Spain
关键词
Data selection; machine learning; optimization; time series; TIME-SERIES; M4;
D O I
10.1109/ACCESS.2021.3066686
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Transmitting and storing large volumes of dynamic / time series data collected by modern sensors can represent a significant technological challenge. A possibility to mitigate this challenge is to effectively select a subset of significant data points in order to reduce data volumes without sacrificing the quality of the results of the subsequent analysis. This paper proposes a method for adaptively identifying optimal data point selection algorithms for sensor time series on a window-by-window basis. Thus, this contribution focuses on quantifying the effect of the application of data selection algorithms to time series windows. The proposed approach is first used on multiple synthetically generated time series obtained by concatenating multiple sources one after the other, and then validated in the entire UCR time series public data archive.
引用
收藏
页码:44390 / 44401
页数:12
相关论文
共 50 条
  • [41] Detecting anomalies in time series data from a manufacturing system using recurrent neural networks
    Wang, Yue
    Perry, Michael
    Whitlock, Dane
    Sutherland, John W.
    JOURNAL OF MANUFACTURING SYSTEMS, 2022, 62 : 823 - 834
  • [42] Translation of Time Series Data from Controlled DC Motors using Disentangled Representation Learning
    Arnout, Hiba
    Bronner, Johanna
    Kehrer, Johannes
    Runkler, Thomas
    2021 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2021), 2021,
  • [43] Predicting time series for water demand in the big data environment using statistical methods, machine learning and the novel analog methodology dynamic time scan forecasting
    Groppo, Gustavo de de Souza
    Costa, Marcelo Azevedo
    Libanio, Marcelo
    WATER SUPPLY, 2023, 23 (02) : 624 - 644
  • [44] Selection of industrial tomatoes using TD-NMR data and computational classification methods
    Borba, Karla R.
    Oldoni, Fernanda C. A.
    Monaretto, Tatiana
    Colnago, Luiz A.
    Ferreira, Marcos D.
    MICROCHEMICAL JOURNAL, 2021, 164
  • [45] Algorithmic paradigms for stability-based cluster validity and model selection statistical methods, with applications to microarray data analysis
    Giancarlo, R.
    Utro, F.
    THEORETICAL COMPUTER SCIENCE, 2012, 428 : 58 - 79
  • [46] Towards Efficient Energy Utilization Using Big Data Analytics in Smart Cities for Electricity Theft Detection
    Arif, Arooj
    Alghamdi, Turki Ali
    Khan, Zahoor Ali
    Javaid, Nadeem
    BIG DATA RESEARCH, 2022, 27
  • [47] Optimization of biohydrogen production from sweet sorghum syrup using statistical methods
    Saraphirom, Piyawadee
    Reungsang, Alissara
    INTERNATIONAL JOURNAL OF HYDROGEN ENERGY, 2010, 35 (24) : 13435 - 13444
  • [48] Data analytics using statistical methods and machine learning: a case study of power transfer units
    Sharmin Sultana Sheuly
    Shaibal Barua
    Shahina Begum
    Mobyen Uddin Ahmed
    Ekrem Güclü
    Michael Osbakk
    The International Journal of Advanced Manufacturing Technology, 2021, 114 : 1859 - 1870
  • [49] A review of statistical and machine learning methods for modeling cancer risk using structured clinical data
    Richter, Aaron N.
    Khoshgoftaar, Taghi M.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2018, 90 : 1 - 14
  • [50] Editorial: Parsing Psychology: Statistical and Computational Methods Using Physiological, Behavioral, Social, and Cognitive Data
    Immekus, Jason C.
    Cipresso, Pietro
    FRONTIERS IN PSYCHOLOGY, 2019, 10