Big-Data-Driven Machine Learning for Enhancing Spatiotemporal Air Pollution Pattern Analysis

被引:17
|
作者
Zareba, Mateusz [1 ]
Dlugosz, Hubert [1 ]
Danek, Tomasz [1 ]
Weglinska, Elzbieta [1 ]
机构
[1] AGH Univ Sci & Technol, Fac Geol Geophys & Environm Protect, Dept Geoinformat & Appl Comp Sci, PL-30059 Krakow, Poland
关键词
big data; machine learning; spatiotemporal; air pollution; pattern analysis; time series; SPATIAL ASSOCIATION;
D O I
10.3390/atmos14040760
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air pollution is an important problem for public health. The spatiotemporal analysis is a crucial step for understanding the complex characteristics of air pollution. Using many sensors and high-resolution time-step observations makes this task a big data challenge. In this study, unsupervised machine learning algorithms were applied to analyze spatiotemporal patterns of air pollution. The analysis was conducted using PM10 big data collected from almost 100 sensors located in Krakow, over a period of one year, with data being recorded at 1-h intervals. The analysis results using K-means and SKATER clustering revealed distinct differences between average and maximum values of pollutant concentrations. The study found that the K-means algorithm with Dynamic Time Warping (DTW) was more accurate in identifying yearly patterns and clustering in rapidly and spatially varying data, compared to the SKATER algorithm. Moreover, the clustering analysis of data after kriging greatly facilitated the interpretation of the results. These findings highlight the potential of machine learning techniques and big data analysis for identifying hot-spots, coldspots, and patterns of air pollution and informing policy decisions related to urban planning, traffic management, and public health interventions.
引用
收藏
页数:25
相关论文
共 50 条
  • [41] A Survey of Machine Learning Methods for Big Data
    Ruiz, Zoila
    Salvador, Jaime
    Garcia-Rodriguez, Jose
    BIOMEDICAL APPLICATIONS BASED ON NATURAL AND ARTIFICIAL COMPUTING, PT II, 2017, 10338 : 259 - 267
  • [42] Editorial: Big data and machine learning in sociology
    Leitgoeb, Heinz
    Prandner, Dimitri
    Wolbring, Tobias
    FRONTIERS IN SOCIOLOGY, 2023, 8
  • [43] Machine learning techniques for explaining air pollution prediction
    Kusy, Maciej
    Kowalski, Piotr A.
    Szwagrzyk, Marcin
    Konior, Aleksander
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [44] Machine learning on big data: Opportunities and challenges
    Zhou, Lina
    Pan, Shimei
    Wang, Jianwu
    Vasilakos, Athanasios V.
    NEUROCOMPUTING, 2017, 237 : 350 - 361
  • [45] Big data and machine learning for crop protection
    Ip, Ryan H. L.
    Ang, Li-Minn
    Seng, Kah Phooi
    Broster, J. C.
    Pratley, J. E.
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2018, 151 : 376 - 383
  • [46] Green Computing for Big Data and Machine Learning
    Barua, Hrishav Bakul
    Mondal, Kartick Chandra
    Khatua, Sunirmal
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 348 - 351
  • [47] Machine learning for Big Data analytics in plants
    Ma, Chuang
    Zhang, Hao Helen
    Wang, Xiangfeng
    TRENDS IN PLANT SCIENCE, 2014, 19 (12) : 798 - 808
  • [48] SDN/NFV, Machine Learning, and Big Data Driven Network Slicing for 5G
    Le, Luong-Vy
    Lin, Bao-Shuh Paul
    Tung, Li-Ping
    Sinh, Do
    2018 IEEE 5G WORLD FORUM (5GWF), 2018, : 20 - 25
  • [49] Machine Learning With Big Data: Challenges and Approaches
    L'Heureux, Alexandra
    Grolinger, Katarina
    Elyamany, Hany F.
    Capretz, Miriam A. M.
    IEEE ACCESS, 2017, 5 : 7776 - 7797
  • [50] Spatiotemporal Deep Learning Model for Citywide Air Pollution Interpolation and Prediction
    Le, Van-Duc
    Bui, Tien-Cuong
    Cha, Sang-Kyun
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 55 - 62