Estimating PM2.5 concentrations via random forest method using satellite, auxiliary, and ground-level station dataset at multiple temporal scales across China in 2017

被引:79
作者
Guo, Bin [1 ]
Zhang, Dingming [1 ]
Pei, Lin [2 ]
Su, Yi [1 ]
Wang, Xiaoxia [1 ]
Bian, Yi [1 ]
Zhang, Donghai [1 ]
Yao, Wanqiang [1 ]
Zhou, Zixiang [1 ]
Guo, Liyu [1 ]
机构
[1] Xian Univ Sci & Technol, Coll Geomat, Xian, Peoples R China
[2] Xi An Jiao Tong Univ, Sch Publ Hlth, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
PM2.5; Machine learning; Multiple data sources; Cross-validation; Mapping; GEOGRAPHICALLY WEIGHTED REGRESSION; FINE PARTICULATE MATTER; NIGHTTIME LIGHT IMAGERY; AEROSOL OPTICAL DEPTH; LAND-USE REGRESSION; LONG-TERM EXPOSURE; AIR-POLLUTION; GLOBAL BURDEN; ORGANIC-CARBON; DISEASE;
D O I
10.1016/j.scitotenv.2021.146288
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Fine particulate matter with aerodynamic diameters less than 2.5 mu m (PM2.5) poses adverse impacts on public health and the environment. It is still a great challenge to estimate high-resolution PM2.5 concentrations at moderate scales. The current study calibrated PM2.5 concentrations at a 1 km resolution scale using ground-level monitoring data, Aerosol Optical Depth (AOD), meteorological data, and auxiliary data via Random Forest (RF) model across China in 2017. The three ten-folded cross-validations (CV) methods including sample-based, time-based, and spatial-based validation combined with Coefficient Square (R-2), Root-Mean-Square Error (RMSE), and Mean Predictive Error (MPE) have been used for validation at different temporal scales in terms of daily, monthly, heating seasonal, and non-heating seasonal. Finally, the distribution map of PM2.5 concentrations was illustrated based on the RF model. Some findings were achieved. The RF model performed well, with a relatively high sample-based cross-validation R-2 of 0.74, a low RMSE of 16.29 mu g x m(-3), and a small MPE of -0.282 mu g x m(-3). Meanwhile, the performance of the RF model in inferring the PM2.5 concentrations was well at urban scales except for Chengyu (CY). North China, the CY urban agglomeration, and the northwest of China exhibited relatively high PM2.5 pollution features, especially in the heating season. The robustness of the RF model in the present study outperformed most statistical regression models for calibrating PM2.5 concentrations. The outcomes can supply an up-to-date scientific dataset for epidemiological and air pollutants exposure risk studies across China. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:14
相关论文
共 94 条
[1]   A Geographically and Temporally Weighted Regression Model for Ground-Level PM2.5 Estimation from Satellite-Derived 500 m Resolution AOD [J].
Bai, Yang ;
Wu, Lixin ;
Qin, Kai ;
Zhang, Yufeng ;
Shen, Yangyang ;
Zhou, Yuan .
REMOTE SENSING, 2016, 8 (03)
[2]   Particulate Air Pollution, Ambulatory Heart Rate Variability, and Cardiac Arrhythmia in Retirement Community Residents with Coronary Artery Disease [J].
Bartell, Scott M. ;
Longhurst, John ;
Tjoa, Thomas ;
Sioutas, Constantinos ;
Delfino, Ralph J. .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2013, 121 (10) :1135-1141
[3]   Critical review of chemometric indicators commonly used for assessing the quality of the prediction of soil attributes by NIR spectroscopy [J].
Bellon-Maurel, Veronique ;
Fernandez-Ahumada, Elvira ;
Palagos, Bernard ;
Roger, Jean-Michel ;
McBratney, Alex .
TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2010, 29 (09) :1073-1081
[4]   Exposure Assessment for Estimation of the Global Burden of Disease Attributable to Outdoor Air Pollution [J].
Brauer, Michael ;
Amann, Markus ;
Burnett, Rick T. ;
Cohen, Aaron ;
Dentener, Frank ;
Ezzati, Majid ;
Henderson, Sarah B. ;
Krzyzanowski, Michal ;
Martin, Randall V. ;
Van Dingenen, Rita ;
van Donkelaar, Aaron ;
Thurston, George D. .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2012, 46 (02) :652-660
[5]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[6]   Extreme gradient boosting model to estimate PM2.5 concentrations with missing-filled satellite data in China [J].
Chen, Zhao-Yue ;
Zhang, Tian-Hao ;
Zhang, Rong ;
Zhu, Zhong-Min ;
Yang, Jun ;
Chen, Ping-Yan ;
Ou, Chun-Quan ;
Guo, Yuming .
ATMOSPHERIC ENVIRONMENT, 2019, 202 :180-189
[8]   PM2.5 mapping using integrated geographically temporally weighted regression (GTWR) and random sample consensus (RANSAC) models [J].
Chu, Hone-Jay ;
Bilal, Muhammad .
ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH, 2019, 26 (02) :1902-1910
[9]  
Cohen AJ, 2017, LANCET, V389, P1907, DOI [10.1016/S0140-6736(17)30505-6, 10.1016/s0140-6736(17)30505-6]
[10]  
Cox LA, 2013, RISK ANAL, V33, P2111, DOI [10.1111/risa.12155, 10.1111/risa.12084]