Mapping of high-resolution daily particulate matter (PM2.5) concentration at the city level through a machine learning-based downscaling approach

被引:0
作者
Nguyen, Phuong D. M. [1 ]
Phan, An H. [1 ]
Ngo, Truong X. [1 ]
Ho, Bang Q. [2 ]
Pham, Tran Vu [3 ]
Nguyen, Thanh T. N. [1 ]
机构
[1] Vietnam Natl Univ Hanoi, Univ Engn & Technol, Fac Informat Technol, E3 Bldg,144 Xuan Thuy St,Dich Vong Hau Ward, Hanoi 100000, Vietnam
[2] Vietnam Natl Univ, Dept Acad Affairs, 142 Hien Thanh St,Dist 10, Ho Chi Minh City 700000, Vietnam
[3] Ho Chi Minh City Univ Technol HCMUT, Fac Comp Sci & Engn, VNU HCM, Ho Chi Minh City 700000, Vietnam
关键词
PM2.5; Downscaling; Machine learning; Deep learning; Ho Chi Minh City;
D O I
10.1007/s10661-024-13562-6
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
PM2.5 pollution is a major global concern, especially in Vietnam, due to its harmful effects on health and the environment. Monitoring local PM2.5 levels is crucial for assessing air quality. However, Vietnam's state-of-the-art (SOTA) dataset with a 3 km resolution needs to be revised to depict spatial variation in smaller regions accurately. In this research, we investigated machine learning-based downscaling methods to improve the spatial resolution and quality of Vietnam's existing 3 km PM2.5 products using different approaches: traditional machine learning models (random forest, XGBoost, Catboost, support vector regression (SVR), mixed effect model (MEM)) and deep learning models (long short-term memory (LSTM), convolutional neural network (CNN), convolutional LSTM (ConvLSTM)). Overall, the CatBoost 2-day lag model exhibited superior performance. In terms of modeling, integrating temporal factors into tree-based models can enhance predictive accuracy. Furthermore, when faced with small datasets, traditional machine learning models demonstrate superior performance over complex deep learning approaches. The validation of machine and deep learning models based on their PM2.5 generated maps is requested because these models can obtain very high results for model evaluation but are unrealistic for application. In this study, compared to the state-of-the-art (SOTA) PM2.5 maps in Vietnam and the SOTA global maps, the proposed CatBoost 2-day lag model's maps showed a 57% increase in the correlation coefficient (Pearson R), as well as 42-73%, 28-75%, and 39-75% reductions in root mean squared error (RMSE), mean relative error (MRE), and mean absolute error (MAE), respectively. Additionally, the daily, monthly, and year-average maps generated by the Catboost 2-day lag model effectively capture the spatial distribution and seasonal variations of PM2.5 in Ho Chi Minh City. These findings indicate a substantial enhancement in the accuracy and reliability of downscaled PM2.5 maps.
引用
收藏
页数:22
相关论文
共 37 条
[1]  
[Anonymous], 2021, Air Pollution
[2]   A random forest guided tour [J].
Biau, Gerard ;
Scornet, Erwan .
TEST, 2016, 25 (02) :197-227
[3]   Influence of Land Use and Meteorological Factors on PM2.5 and PM10 Concentrations in Bangkok, Thailand [J].
Cheewinsiriwat, Pannee ;
Duangyiwa, Chanita ;
Sukitpaneenit, Manlika ;
Stettler, Marc E. J. .
SUSTAINABILITY, 2022, 14 (09)
[4]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[5]  
Chu F, 2005, STUD FUZZ SOFT COMP, V177, P343
[6]   PM2.5 volatility prediction by XGBoost-MLP based on GARCH models [J].
Dai, Hongbin ;
Huang, Guangqiu ;
Zeng, Huibin ;
Zhou, Fangyu .
JOURNAL OF CLEANER PRODUCTION, 2022, 356
[7]  
Didan K., 2015, MOD13Q1 MODIS TERRA, DOI [10.5067/MODIS/MOD13A3.006/, DOI 10.5067/MODIS/MOD13Q1.006]
[8]   Downscaling of Open Coarse Precipitation Data through Spatial and Statistical Analysis, Integrating NDVI, NDWI, Elevation, and Distance from Sea [J].
Ezzine, Hicham ;
Bouziane, Ahmed ;
Ouazar, Driss ;
Hasnaoui, Moulay Driss .
ADVANCES IN METEOROLOGY, 2017, 2017
[9]  
Gdalwarp, 2022, GDAL documentation
[10]   LSTM: A Search Space Odyssey [J].
Greff, Klaus ;
Srivastava, Rupesh K. ;
Koutnik, Jan ;
Steunebrink, Bas R. ;
Schmidhuber, Juergen .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (10) :2222-2232