Evaluation of Machine Learning Models for Estimating PM2.5 Concentrations across Malaysia

被引:26
作者
Zaman, Nurul Amalin Fatihah Kamarul [1 ]
Kanniah, Kasturi Devi [1 ,2 ]
Kaskaoutis, Dimitris G. [3 ,4 ]
Latif, Mohd Talib [5 ]
机构
[1] Univ Teknol Malaysia, Fac Built Environm & Surveying, Trop Map Res Grp, Skudai 81310, Johor, Malaysia
[2] Univ Teknol Malaysia, Ctr Environm Sustainabil & Water Secur IPASA, Res Inst Sustainable Environm, Utm 81310, Johor, Malaysia
[3] Natl Observ Athens, Inst Environm Res & Sustainable Dev, Athens 15236, Greece
[4] Univ Crete, Dept Chem, Environm Chem Proc Lab, Iraklion 71003, Greece
[5] Univ Kebangsaan Malaysia, Fac Sci & Technol, Dept Earth Sci & Environm, Bangi 43600, Selangor, Malaysia
来源
APPLIED SCIENCES-BASEL | 2021年 / 11卷 / 16期
关键词
PM2; 5; Himawari-8; random forest; support vector regression; air pollution; Malaysia; AEROSOL OPTICAL DEPTH; GROUND-LEVEL PM2.5; PARTICULATE MATTER; METEOROLOGICAL VARIABLES; NEXT-GENERATION; AIR-QUALITY; POLLUTANT CONCENTRATIONS; OZONE CONCENTRATIONS; PM10; CONCENTRATION; NEURAL-NETWORK;
D O I
10.3390/app11167326
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Southeast Asia (SEA) is a hotspot region for atmospheric pollution and haze conditions, due to extensive forest, agricultural and peat fires. This study aims to estimate the PM2.5 concentrations across Malaysia using machine-learning (ML) models like Random Forest (RF) and Support Vector Regression (SVR), based on satellite AOD (aerosol optical depth) observations, ground measured air pollutants (NO2, SO2, CO, O-3) and meteorological parameters (air temperature, relative humidity, wind speed and direction). The estimated PM2.5 concentrations for a two-year period (2018-2019) are evaluated against measurements performed at 65 air-quality monitoring stations located at urban, industrial, suburban and rural sites. PM2.5 concentrations varied widely between the stations, with higher values (mean of 24.2 +/- 21.6 mu g m(-3)) at urban/industrial stations and lower (mean of 21.3 +/- 18.4 mu g m(-3)) at suburban/rural sites. Furthermore, pronounced seasonal variability in PM2.5 is recorded across Malaysia, with highest concentrations during the dry season (June-September). Seven models were developed for PM2.5 predictions, i.e., separately for urban/industrial and suburban/rural sites, for the four dominant seasons (dry, wet and two inter-monsoon), and an overall model, which displayed accuracies in the order of R-2 = 0.46-0.76. The validation analysis reveals that the RF model (R-2 = 0.53-0.76) exhibits slightly better performance than SVR, except for the overall model. This is the first study conducted in Malaysia for PM2.5 estimations at a national scale combining satellite aerosol retrievals with ground-based pollutants, meteorological factors and ML techniques. The satisfactory prediction of PM2.5 concentrations across Malaysia allows a continuous monitoring of the pollution levels at remote areas with absence of measurement networks.
引用
收藏
页数:24
相关论文
共 157 条
  • [41] Grivas G, 2017, SCI TOTAL ENVIRON, V581, P518, DOI [10.1016/j.sdtotenv.2016.12.159, 10.1016/j.scitotenv.2016.12.159]
  • [42] Estimating ground-level PM2.5 concentrations in Beijing using a satellite-based geographically and temporally weighted regression model
    Guo, Yuanxi
    Tang, Qiuhong
    Gong, Dao-Yi
    Zhang, Ziyin
    [J]. REMOTE SENSING OF ENVIRONMENT, 2017, 198 : 140 - 149
  • [43] Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: 2. A neural network approach
    Gupta, Pawan
    Christopher, Sundar A.
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2009, 114
  • [44] Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: Multiple regression approach
    Gupta, Pawan
    Christopher, Sundar A.
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2009, 114
  • [45] Extreme nitrogen oxide and ozone concentrations in athens atmosphere in relation to meteorological conditions
    Hatzianastassiou, Nikolaos
    Katsoulis, Basil D.
    Antakis, Basil
    [J]. ENVIRONMENTAL MONITORING AND ASSESSMENT, 2007, 128 (1-3) : 447 - 464
  • [46] Estimation of Surface Downward Shortwave Radiation over China from Himawari-8 AHI Data Based on Random Forest
    Hou, Ning
    Zhang, Xiaotong
    Zhang, Weiyu
    Wei, Yu
    Jia, Kun
    Yao, Yunjun
    Jiang, Bo
    Cheng, Jie
    [J]. REMOTE SENSING, 2020, 12 (01)
  • [47] Estimating PM2.5 Concentrations in the Conterminous United States Using the Random Forest Approach
    Hu, Xuefei
    Belle, Jessica H.
    Meng, Xia
    Wildani, Avani
    Waller, Lance A.
    Strickland, Matthew J.
    Liu, Yang
    [J]. ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2017, 51 (12) : 6936 - 6944
  • [48] Estimating ground-level PM2.5 concentrations in the Southeastern United States using MAIAC AOD retrievals and a two-stage model
    Hu, Xuefei
    Waller, Lance A.
    Lyapustin, Alexei
    Wang, Yujie
    Al-Hamdan, Mohammad Z.
    Crosson, William L.
    Estes, Maurice G., Jr.
    Estes, Sue M.
    Quattrochi, Dale A.
    Puttaswamy, Sweta Jinnagara
    Liu, Yang
    [J]. REMOTE SENSING OF ENVIRONMENT, 2014, 140 : 220 - 232
  • [49] Estimating ground-level PM2.5 concentrations in the southeastern US using geographically weighted regression
    Hu, Xuefei
    Waller, Lance A.
    Al-Hamdan, Mohammad Z.
    Crosson, William L.
    Estes, Maurice G., Jr.
    Estes, Sue M.
    Quattrochi, Dale A.
    Sarnat, Jeremy A.
    Liu, Yang
    [J]. ENVIRONMENTAL RESEARCH, 2013, 121 : 1 - 10
  • [50] Spatial analysis of MODIS aerosol optical depth, PM2.5, and chronic coronary heart disease
    Hu, Zhiyong
    [J]. INTERNATIONAL JOURNAL OF HEALTH GEOGRAPHICS, 2009, 8