A survey on outlier detection methods applied on air quality data

被引:0
|
作者
Stroia-Vlad, Iuliana-Andreea [1 ]
Danciu, Gabriel Mihail [1 ]
机构
[1] Transilvania Univ Brasov, Dept Elect & Comp, Brasov, Romania
关键词
air pollution; time series; statistics; machine learning; regression;
D O I
10.1109/isetc50328.2020.9301140
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a study on the impact of various time series prediction algorithms applied on air quality data. This data is obtained from several sensors measurements, at every passing minute. The current research is concerned about finding a solution for a prediction algorithm based on fit functions. Traditional statistics models such as ARIMA (AutoRegressive Integrated Moving Average Model) and modern ones, like Facebook Prophet, were used for a comparative approach. Moreover, our proposed method has also been tested using different types of regression: Linear, Polynomial and Spline. After having made all the possible analogies between the selected algorithms for the given time series, regression spline has been found as the most accurate model. The purpose of this paper is to explain and to convince that results behave in a different manner depending on the used algorithm. The research has been done by studying air quality measurements received from various sensors, such as: PM2.5, PM1, PM10, O-3, CH2O, temperature, pressure and CO2. The study analyses sensors' values over a period of several months, obtaining over 43000 measurements per month for each sensor. The paper discusses the data obtained and its accuracy is tested using various metrics of evaluation.
引用
收藏
页码:23 / 26
页数:4
相关论文
共 50 条
  • [1] Outlier detection methods to improve the quality of citizen science data
    Jennifer S. Li
    Andreas Hamann
    Elisabeth Beaubien
    International Journal of Biometeorology, 2020, 64 : 1825 - 1833
  • [2] Outlier detection methods to improve the quality of citizen science data
    Li, Jennifer S.
    Hamann, Andreas
    Beaubien, Elisabeth
    INTERNATIONAL JOURNAL OF BIOMETEOROLOGY, 2020, 64 (11) : 1825 - 1833
  • [3] A survey on unsupervised subspace outlier detection methods for high dimensional data
    Ahn, Jaehyeong
    Kwon, Sunghoon
    KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (03) : 507 - 521
  • [4] Outlier Detection for Temporal Data: A Survey
    Gupta, Manish
    Gao, Jing
    Aggarwal, Charu C.
    Han, Jiawei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2250 - 2267
  • [5] A survey of machine learning methods applied to anomaly detection on drinking-water quality data
    Dogo, Eustace M.
    Nwulu, Nnamdi, I
    Twala, Bhekisipho
    Aigbavboa, Clinton
    URBAN WATER JOURNAL, 2019, 16 (03) : 235 - 248
  • [6] A Survey of Outlier Detection Algorithms for Data Streams
    Tamboli, Jinita
    Shukla, Madhu
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3535 - 3540
  • [7] Outlier detection over data streams: Survey
    Brahmi Z.
    Souiden I.
    International Journal of Business Intelligence and Data Mining, 2021, 19 (04) : 481 - 507
  • [8] Discussion of Outlier Detection Methods of Purchasing Data
    Kono, Katsuya
    Yamamoto, Yoshiro
    2016 14TH INTERNATIONAL CONFERENCE ON ICT AND KNOWLEDGE ENGINEERING (ICT&KE), 2016, : 12 - 18
  • [9] A Survey of Outlier Detection Methods in Network Anomaly Identification
    Gogoi, Prasanta
    Bhattacharyya, D. K.
    Borah, B.
    Kalita, Jugal K.
    COMPUTER JOURNAL, 2011, 54 (04): : 570 - 588
  • [10] WMEVF: AN OUTLIER DETECTION METHODS FOR CATEGORICAL DATA
    Rokhman, Nur
    Subanar
    Winarko, Edi
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 37 - 42