PM2.5 Prediction Based on Random Forest, XGBoost, and Deep Learning Using Multisource Remote Sensing Data

被引:355
作者
Joharestani, Mehdi Zamani [1 ,2 ]
Cao, Chunxiang [1 ,2 ]
Ni, Xiliang [1 ,2 ]
Bashir, Barjeece [1 ,2 ]
Talebiesfandarani, Somayeh [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Remote Sensing & Digital Earth, State Key Lab Remote Sensing Sci, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
PM25; prediction; XGBoost; random forest; deep leaning; feature importance; PARTICULATE AIR-POLLUTION; AEROSOL OPTICAL DEPTH; GROUND-LEVEL PM2.5; RISK-ASSESSMENT; NEURAL-NETWORK; HEALTH IMPACT; TEHRAN; QUALITY; MORTALITY; MATTER;
D O I
10.3390/atmos10070373
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In recent years, air pollution has become an important public health concern. The high concentration of fine particulate matter with diameter less than 2.5 mu m (PM2.5) is known to be associated with lung cancer, cardiovascular disease, respiratory disease, and metabolic disease. Predicting PM2.5 concentrations can help governments warn people at high risk, thus mitigating the complications. Although attempts have been made to predict PM2.5 concentrations, the factors influencing PM2.5 prediction have not been investigated. In this work, we study feature importance for PM2.5 prediction in Tehran's urban area, implementing random forest, extreme gradient boosting, and deep learning machine learning (ML) approaches. We use 23 features, including satellite and meteorological data, ground-measured PM2.5, and geographical data, in the modeling. The best model performance obtained was R-2 = 0.81 (R = 0.9), MAE = 9.93 mu g/m(3), and RMSE = 13.58 mu g/m(3) using the XGBoost approach, incorporating elimination of unimportant features. However, all three ML methods performed similarly and R-2 varied from 0.63 to 0.67, when Aerosol Optical Depth (AOD) at 3 km resolution was included, and 0.77 to 0.81, when AOD at 3 km resolution was excluded. Contrary to the PM2.5 lag data, satellite-derived AODs did not improve model performance.
引用
收藏
页数:19
相关论文
共 54 条
[1]   Global air quality and pollution [J].
Akimoto, H .
SCIENCE, 2003, 302 (5651) :1716-1719
[2]   Seasonal variations in the oxidative stress and inflammatory potential of PM2.5 in Tehran using an alveolar macrophage model; The role of chemical composition and sources [J].
Al Hanai, Ahlam H. ;
Antkiewicz, Dagmara S. ;
Hemming, Jocelyn D. C. ;
Shafer, Martin M. ;
Lai, Alexandra M. ;
Arhami, Mohammad ;
Hosseini, Vahid ;
Schauer, James J. .
ENVIRONMENT INTERNATIONAL, 2019, 123 :417-427
[3]  
[Anonymous], UN DESA WORLD URB PR
[4]  
[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[5]  
[Anonymous], 2019, J. Air Pollut. Health.
[6]  
[Anonymous], 2017, OCCUPATIONAL ENV HLT
[7]   Meteorological correlates and AirQ+ health risk assessment of ambient fine particulate matter in Tehran, Iran [J].
Ansari, Mohsen ;
Ehrampoush, Mohammad Hassan .
ENVIRONMENTAL RESEARCH, 2019, 170 :141-150
[8]   Seasonal trends in the composition and sources of PM2.5 and carbonaceous aerosol in Tehran, Iran [J].
Arhami, Mohammad ;
Shahne, Maryam Zare ;
Hosseini, Vahid ;
Haghighat, Navid Roufigar ;
Lai, Alexandra M. ;
Schauer, James J. .
ENVIRONMENTAL POLLUTION, 2018, 239 :69-81
[9]   Seasonal trends, chemical speciation and source apportionment of fine PM in Tehran [J].
Arhami, Mohammad ;
Hosseini, Vahid ;
Shahne, Maryam Zare ;
Bigdeli, Mostafa ;
Lai, Alexandra ;
Schauer, James J. .
ATMOSPHERIC ENVIRONMENT, 2017, 153 :70-82
[10]  
Bengio Y, 2013, INT CONF ACOUST SPEE, P8624, DOI 10.1109/ICASSP.2013.6639349