Big-Data-Driven Machine Learning for Enhancing Spatiotemporal Air Pollution Pattern Analysis

被引:17
|
作者
Zareba, Mateusz [1 ]
Dlugosz, Hubert [1 ]
Danek, Tomasz [1 ]
Weglinska, Elzbieta [1 ]
机构
[1] AGH Univ Sci & Technol, Fac Geol Geophys & Environm Protect, Dept Geoinformat & Appl Comp Sci, PL-30059 Krakow, Poland
关键词
big data; machine learning; spatiotemporal; air pollution; pattern analysis; time series; SPATIAL ASSOCIATION;
D O I
10.3390/atmos14040760
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air pollution is an important problem for public health. The spatiotemporal analysis is a crucial step for understanding the complex characteristics of air pollution. Using many sensors and high-resolution time-step observations makes this task a big data challenge. In this study, unsupervised machine learning algorithms were applied to analyze spatiotemporal patterns of air pollution. The analysis was conducted using PM10 big data collected from almost 100 sensors located in Krakow, over a period of one year, with data being recorded at 1-h intervals. The analysis results using K-means and SKATER clustering revealed distinct differences between average and maximum values of pollutant concentrations. The study found that the K-means algorithm with Dynamic Time Warping (DTW) was more accurate in identifying yearly patterns and clustering in rapidly and spatially varying data, compared to the SKATER algorithm. Moreover, the clustering analysis of data after kriging greatly facilitated the interpretation of the results. These findings highlight the potential of machine learning techniques and big data analysis for identifying hot-spots, coldspots, and patterns of air pollution and informing policy decisions related to urban planning, traffic management, and public health interventions.
引用
收藏
页数:25
相关论文
共 50 条
  • [21] Big Data Analysis of TV Dramas Based on Machine Learning
    Tan, Jiaqi
    Mao, Feiqiao
    Yang, Lianghai
    Wang, Jiahui
    SMART COMPUTING AND COMMUNICATION, SMARTCOM 2017, 2018, 10699 : 90 - 95
  • [22] Tension in big data using machine learning: Analysis and applications
    Wang, Huamao
    Yao, Yumei
    Salhi, Said
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2020, 158
  • [23] Big data and machine learning driven bioprocessing-Recent trends and critical analysis
    Yang, Chao-Tung
    Kristiani, Endah
    Leong, Yoong Kit
    Chang, Jo-Shu
    BIORESOURCE TECHNOLOGY, 2023, 372
  • [24] Big-data-driven Anomaly Detection in Industry (4.0): an approach and a case study
    Stojanovic, Ljiljana
    Dinic, Marko
    Stojanovic, Nenad
    Stojadinovic, Aleksandar
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 1647 - 1652
  • [25] An Open Framework for Dynamic Big-Data-Driven Application Systems (DBDDAS) Development
    Douglas, Craig C.
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, 2014, 29 : 1246 - 1255
  • [26] Machine Learning Aided Air Traffic Flow Analysis Based on Aviation Big Data
    Gui, Guan
    Zhou, Ziqi
    Wang, Juan
    Liu, Fan
    Sun, Jinlong
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (05) : 4817 - 4826
  • [27] Machine Learning under Big Data
    Shi, Chunhe
    Wu, Chengdong
    Han, Xiaowei
    Xie, Yinghong
    Li, Zhen
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ELECTRONIC, MECHANICAL, INFORMATION AND MANAGEMENT SOCIETY (EMIM), 2016, 40 : 301 - 305
  • [28] Air Pollution Modelling by Machine Learning Methods
    Vidnerova, Petra
    Neruda, Roman
    MODELLING, 2021, 2 (04): : 659 - 674
  • [29] Big data driven order-up-to level model: Application of machine learning
    Clausen, Johan Bjerre Bach
    Li, Hongyan
    COMPUTERS & OPERATIONS RESEARCH, 2022, 139
  • [30] Pollution and Weather Reports: Using Machine Learning for Combating Pollution in Big Cities
    Popa, Cicerone Laurentiu
    Dobrescu, Tiberiu Gabriel
    Silvestru, Catalin-Ionut
    Firulescu, Alexandru-Cristian
    Popescu, Constantin Adrian
    Cotet, Costel Emil
    SENSORS, 2021, 21 (21)