A survey on outlier detection methods applied on air quality data

被引:0
|
作者
Stroia-Vlad, Iuliana-Andreea [1 ]
Danciu, Gabriel Mihail [1 ]
机构
[1] Transilvania Univ Brasov, Dept Elect & Comp, Brasov, Romania
关键词
air pollution; time series; statistics; machine learning; regression;
D O I
10.1109/isetc50328.2020.9301140
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a study on the impact of various time series prediction algorithms applied on air quality data. This data is obtained from several sensors measurements, at every passing minute. The current research is concerned about finding a solution for a prediction algorithm based on fit functions. Traditional statistics models such as ARIMA (AutoRegressive Integrated Moving Average Model) and modern ones, like Facebook Prophet, were used for a comparative approach. Moreover, our proposed method has also been tested using different types of regression: Linear, Polynomial and Spline. After having made all the possible analogies between the selected algorithms for the given time series, regression spline has been found as the most accurate model. The purpose of this paper is to explain and to convince that results behave in a different manner depending on the used algorithm. The research has been done by studying air quality measurements received from various sensors, such as: PM2.5, PM1, PM10, O-3, CH2O, temperature, pressure and CO2. The study analyses sensors' values over a period of several months, obtaining over 43000 measurements per month for each sensor. The paper discusses the data obtained and its accuracy is tested using various metrics of evaluation.
引用
收藏
页码:23 / 26
页数:4
相关论文
共 50 条
  • [41] Unsupervised outlier detection for time-series data of indoor air quality using LSTM autoencoder with ensemble method
    Park, Junhyeok
    Seo, Youngsuk
    Cho, Jaehyuk
    JOURNAL OF BIG DATA, 2023, 10 (01)
  • [42] A Survey of Outlier Detection Methodologies and Their Applications
    Niu, Zhixian
    Shi, Shuping
    Sun, Jingyu
    He, Xiu
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT I, 2011, 7002 : 380 - 387
  • [43] Unsupervised outlier detection for time-series data of indoor air quality using LSTM autoencoder with ensemble method
    Junhyeok Park
    Youngsuk Seo
    Jaehyuk Cho
    Journal of Big Data, 10
  • [44] Outlier detection strategies for WSNs: A survey
    Chander, Bhanu
    Kumaravelan, G.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (08) : 5684 - 5707
  • [45] QualESTIM: Interactive Quality Assessment of Socioeconomic Data using Outlier Detection
    Plumejeaud, Christine
    Villanova-Oliver, Marlene
    BRIDGING THE GEOGRAPHIC INFORMATION SCIENCES, 2012, : 143 - 160
  • [46] Progress in Outlier Detection Techniques: A Survey
    Wang, Hongzhi
    Bah, Mohamed Jaward
    Hammad, Mohamed
    IEEE ACCESS, 2019, 7 : 107964 - 108000
  • [47] Outlier detection in interval data
    A. Pedro Duarte Silva
    Peter Filzmoser
    Paula Brito
    Advances in Data Analysis and Classification, 2018, 12 : 785 - 822
  • [48] Outlier detection in astronomical data
    Zhang, YX
    Luo, A
    Zhao, YH
    OPTIMIZING SCIENTIFIC RETURN FOR ASTRONOMY THROUGH INFORMATION TECHNOLOGIES, 2004, 5493 : 521 - 529
  • [49] Air Big Data Outlier Detection Based on Infinite Gauss Bayesian and CNN
    Zhou, LiangQi
    Xu, HongZhen
    Wei, Li
    Zhang, Quan
    Zhou, Fei
    Li, ZhuoPei
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 317 - 321
  • [50] The BACON-EEM algorithm for multivariate outlier detection in incomplete survey data
    Beguin, Cedric
    Hulliger, Beat
    SURVEY METHODOLOGY, 2008, 34 (01) : 91 - 103