Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia

被引:26
作者
Zaman, Nurul Amalin Fatihah Kamarul [1 ]
Kanniah, Kasturi Devi [1 ,2 ]
Kaskaoutis, Dimitris G. [3 ,4 ]
Latif, Mohd Talib [5 ]
机构
[1] Univ Teknol Malaysia, Fac Built Environm & Surveying, Trop Map Res Grp, Skudai 81310, Johor, Malaysia
[2] Univ Teknol Malaysia, Ctr Environm Sustainabil & Water Secur IPASA, Res Inst Sustainable Environm, Utm 81310, Johor, Malaysia
[3] Natl Observ Athens, Inst Environm Res & Sustainable Dev, Athens 15236, Greece
[4] Univ Crete, Dept Chem, Environm Chem Proc Lab, Iraklion 71003, Greece
[5] Univ Kebangsaan Malaysia, Fac Sci & Technol, Dept Earth Sci & Environm, Bangi 43600, Selangor, Malaysia
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 16期
关键词
PM2; 5; Himawari-8; random forest; support vector regression; air pollution; Malaysia; AEROSOL OPTICAL DEPTH; GROUND-LEVEL PM2.5; PARTICULATE MATTER; METEOROLOGICAL VARIABLES; NEXT-GENERATION; AIR-QUALITY; POLLUTANT CONCENTRATIONS; OZONE CONCENTRATIONS; PM10; CONCENTRATION; NEURAL-NETWORK;
D O I
10.3390/app11167326
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Southeast Asia (SEA) is a hotspot region for atmospheric pollution and haze conditions, due to extensive forest, agricultural and peat fires. This study aims to estimate the PM2.5 concentrations across Malaysia using machine-learning (ML) models like Random Forest (RF) and Support Vector Regression (SVR), based on satellite AOD (aerosol optical depth) observations, ground measured air pollutants (NO2, SO2, CO, O-3) and meteorological parameters (air temperature, relative humidity, wind speed and direction). The estimated PM2.5 concentrations for a two-year period (2018-2019) are evaluated against measurements performed at 65 air-quality monitoring stations located at urban, industrial, suburban and rural sites. PM2.5 concentrations varied widely between the stations, with higher values (mean of 24.2 +/- 21.6 mu g m(-3)) at urban/industrial stations and lower (mean of 21.3 +/- 18.4 mu g m(-3)) at suburban/rural sites. Furthermore, pronounced seasonal variability in PM2.5 is recorded across Malaysia, with highest concentrations during the dry season (June-September). Seven models were developed for PM2.5 predictions, i.e., separately for urban/industrial and suburban/rural sites, for the four dominant seasons (dry, wet and two inter-monsoon), and an overall model, which displayed accuracies in the order of R-2 = 0.46-0.76. The validation analysis reveals that the RF model (R-2 = 0.53-0.76) exhibits slightly better performance than SVR, except for the overall model. This is the first study conducted in Malaysia for PM2.5 estimations at a national scale combining satellite aerosol retrievals with ground-based pollutants, meteorological factors and ML techniques. The satisfactory prediction of PM2.5 concentrations across Malaysia allows a continuous monitoring of the pollution levels at remote areas with absence of measurement networks.
引用
收藏
页数:24
相关论文
共 157 条
  • [1] Forecasting Particulate Matter Concentration Using Linear and Non-Linear Approaches for Air Quality Decision Support
    Abdullah, Samsuri
    Ismail, Marzuki
    Ahmed, Ali Najah
    Abdullah, Ahmad Makmom
    [J]. ATMOSPHERE, 2019, 10 (11)
  • [2] Estimating fine particulate concentration using a combined approach of linear regression and artificial neural network
    Ahmad, Maqbool
    Alam, Khan
    Tariq, Shahina
    Anwar, Sajid
    Nasir, Jawad
    Mansha, Muhammad
    [J]. ATMOSPHERIC ENVIRONMENT, 2019, 219
  • [3] Calibration Model of a Low-Cost Air Quality Sensor Using an Adaptive Neuro-Fuzzy Inference System
    Alhasa, Kemal Maulana
    Nadzir, Mohd Shahrul Mohd
    Olalekan, Popoola
    Latif, Mohd Talib
    Yusup, Yusri
    Faruque, Mohammad Rashed Iqbal
    Ahamad, Fatimah
    Abd Hamid, Haris Hafizal
    Aiyub, Kadaruddin
    Ali, Sawal Hamid Md
    Khan, Md Firoz
    Abu Samah, Azizan
    Yusuff, Imran
    Othman, Murnira
    Hassim, Tengku Mohd Farid Tengku
    Ezani, Nor Eliani
    [J]. SENSORS, 2018, 18 (12)
  • [4] Seasonal variability of PM2.5 composition and sources in the Klang Valley urban-industrial environment
    Amil, Norhaniza
    Latif, Mohd Talib
    Khan, Md Firoz
    Mohamad, Maznorizan
    [J]. ATMOSPHERIC CHEMISTRY AND PHYSICS, 2016, 16 (08) : 5357 - 5381
  • [5] [Anonymous], 2018, WHO Global Ambient Air Quality Database (Update 2018)
  • [6] Spatiotemporal Variations and Contributing Factors of Air Pollutant Concentrations in Malaysia during Movement Control Order due to Pandemic COVID-19
    Ash'aari, Zulfa Hanan
    Aris, Ahmad Zaharin
    Ezani, Eliani
    Kamal, Nurfatin Izzati Ahmad
    Jaafar, Norlin
    Jahaya, Jasrul Nizam
    Manan, Shamsuri Abdul
    Saifuddin, Muhamad Firdaus Umar
    [J]. AEROSOL AND AIR QUALITY RESEARCH, 2020, 20 (10) : 2047 - 2061
  • [7] Awad M., 2015, Efficient learning machines: Theories, concepts, and applications for engineers and system designers p, P67, DOI [10.1007/978-1-4302-5990-94, DOI 10.1007/978-1-4302-5990-9_4]
  • [8] Awang M B, 2000, Respirology, V5, P183, DOI 10.1046/j.1440-1843.2000.00248.x
  • [9] A Geographically and Temporally Weighted Regression Model for Ground-Level PM2.5 Estimation from Satellite-Derived 500 m Resolution AOD
    Bai, Yang
    Wu, Lixin
    Qin, Kai
    Zhang, Yufeng
    Shen, Yangyang
    Zhou, Yuan
    [J]. REMOTE SENSING, 2016, 8 (03)
  • [10] Estimating urban PM10 and PM2.5 concentrations, based on synergistic MERIS/AATSR aerosol observations, land cover and morphology data
    Beloconi, Anton
    Kamarianakis, Yiannis
    Chrysoulakis, Nektarios
    [J]. REMOTE SENSING OF ENVIRONMENT, 2016, 172 : 148 - 164