A Comparison of Machine Learning-Based Approaches in Estimating Surface PM2.5 Concentrations Focusing on Artificial Neural Networks and High Pollution Events

被引:3
作者
Wei, Shijin [1 ]
Shores, Kyle [2 ]
Xu, Yangyang [1 ]
机构
[1] Texas A&M Univ, Dept Atmospher Sci, College Stn, TX 77840 USA
[2] Natl Ctr Atmospher Res, Boulder, CO 80305 USA
关键词
machine learning; air quality; artificial neural network; MERRA-2; reanalysis; high pollution events; AEROSOL OPTICAL DEPTH; GROUND-LEVEL PM2.5; PARTICULATE MATTER; AIR-POLLUTION; SATELLITE; MODIS; CHINA;
D O I
10.3390/atmos16010048
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Surface PM2.5 concentrations have significant implications for human health, necessitating accurate estimations. This study compares various machine learning models, including linear models, tree-based algorithms, and artificial neural networks (ANNs) for estimating PM2.5 concentrations using the MERRA-2 dataset from 2012 to 2023. Mutual information and Spearman cross-feature correlation scores are used during feature selections. The performance of models is evaluated using metrics including normalized Nash-Sutcliffe efficiency (NNSE), root mean standard deviation ratio (RSR), and mean percentage error (MPE). Our results show that ANNs outperform linear and tree models, particularly in estimating daily PM2.5 concentrations of 35-1000 mu g/m3. ANNs improve NNSE by 119% and 46%, RSR by 40% and 24%, and MPE by 44% and 30% from linear and tree models, respectively, indicating ANN's superior estimation performance during high pollution days. The sensitivity analysis of features that interpret the models suggests that the total extinction AOD at 550 nm and surface CO concentrations are the most important features in the Western and Eastern U.S., respectively. The findings suggest that even the simplest NNs provide better air quality estimates, especially during high pollution events, which is beneficial for long-term exposure analysis. Future research should explore more sophisticated NN architectures with spatial and temporal variations in PM2.5 to improve the model performance.
引用
收藏
页数:35
相关论文
共 79 条
[71]   A Spatial-Temporal Interpretable Deep Learning Model for improving interpretability and predictive accuracy of satellite-based PM2.5 [J].
Yan, Xing ;
Zang, Zhou ;
Jiang, Yize ;
Shi, Wenzhong ;
Guo, Yushan ;
Li, Dan ;
Zhao, Chuanfeng ;
Husi, Letu .
ENVIRONMENTAL POLLUTION, 2021, 273
[72]   XGBoost-SHAP-based interpretable diagnostic framework for alzheimer's disease [J].
Yi, Fuliang ;
Yang, Hui ;
Chen, Durong ;
Qin, Yao ;
Han, Hongjuan ;
Cui, Jing ;
Bai, Wenlin ;
Ma, Yifei ;
Zhang, Rong ;
Yu, Hongmei .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
[73]   PM2.5 Modeling and Historical Reconstruction over the Continental USA Utilizing GOES-16 AOD [J].
Yu, Xiaohe ;
Lary, David J. ;
Simmons, Christopher S. .
REMOTE SENSING, 2021, 13 (23)
[74]   Development of a stacked ensemble model for forecasting and analyzing daily average PM2.5 concentrations in Beijing, China [J].
Zhai, Binxu ;
Chen, Jianguo .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 635 :644-658
[75]   Spatiotemporal prediction of continuous daily PM2.5 concentrations across China using a spatially explicit machine learning algorithm [J].
Zhan, Yu ;
Luo, Yuzhou ;
Deng, Xunfei ;
Chen, Huajin ;
Grieneisen, Michael L. ;
Shen, Xueyou ;
Zhu, Lizhong ;
Zhang, Minghua .
ATMOSPHERIC ENVIRONMENT, 2017, 155 :129-139
[76]   Fusing XGBoost and SHAP Models for Maritime Accident Prediction and Causality Interpretability Analysis [J].
Zhang, Cheng ;
Zou, Xiong ;
Lin, Chuan .
JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (08)
[77]   Spatial modeling of PM2.5 concentrations with a multifactoral radial basis function neural network [J].
Zou, Bin ;
Wang, Min ;
Wan, Neng ;
Wilson, J. Gaines ;
Fang, Xin ;
Tang, Yuqi .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2015, 22 (14) :10395-10404
[78]   Regularization and variable selection via the elastic net [J].
Zou, H ;
Hastie, T .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2005, 67 :301-320
[79]   Comparison of Six Machine Learning Methods for Estimating PM2.5 Concentration Using the Himawari-8 Aerosol Optical Depth [J].
Zuo, Xin ;
Guo, Hong ;
Shi, Shuaiyi ;
Zhang, Xiaochuan .
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2020, 48 (09) :1277-1287