Predicting intraurban PM2.5 concentrations using enhanced machine learning approaches and incorporating human activity patterns

被引:22
作者
Ashayeri, Mehdi [1 ,3 ]
Abbasabadi, Narjes [1 ,4 ]
Heidarinejad, Mohammad [2 ]
Stephens, Brent [2 ]
机构
[1] IIT, Coll Architecture, Chicago, IL 60616 USA
[2] IIT, Dept Civil Architectural & Environm Engn, Alumni Mem Hall Room 228E,3201 South Dearborn St, Chicago, IL 60616 USA
[3] Southern Illinois Univ, Sch Architecture, Coll Hlth & Human Sci, Carbondale, IL USA
[4] Univ Texas Arlington, Coll Architecture Planning & Publ Affairs, Arlington, TX 76019 USA
关键词
Outdoor air quality; Artificial intelligence; Human activity; Air pollution modeling; Statistical prediction model; LAND-USE REGRESSION; URBAN AIR-POLLUTION; PARTICULATE MATTER; UNITED-STATES; METEOROLOGICAL VARIABLES; POPULATION EXPOSURE; SPATIAL VARIABILITY; MODEL PREDICTIONS; AMBIENT PM2.5; BLACK CARBON;
D O I
10.1016/j.envres.2020.110423
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Urban areas contribute substantially to human exposure to ambient air pollution. Numerous statistical prediction models have been used to estimate ambient concentrations of fine particulate matter (PM2.5) and other pollutants in urban environments, with some incorporating machine learning (ML) algorithms to improve predictive power. However, many ML approaches for predicting ambient pollutant concentrations to date have used principal component analysis (PCA) with traditional regression algorithms to explore linear correlations between variables and to reduce the dimensionality of the data. Moreover, while most urban air quality prediction models have traditionally incorporated explanatory variables such as meteorological, land use, transportation/mobility, and/ or co-pollutant factors, recent research has shown that local emissions from building infrastructure may also be useful factors to consider in estimating urban pollutant concentrations. Here we propose an enhanced ML approach for predicting urban ambient PM2.5 concentrations that hybridizes cascade and PCA methods to reduce the dimensionality of the data-space and explore nonlinear effects between variables. We test the approach using different durations of time series air quality datasets of hourly PM2.5 concentrations from three air quality monitoring sites in different urban neighborhoods in Chicago, IL to explore the influence of dynamic humanrelated factors, including mobility (i.e., traffic) and building occupancy patterns, on model performance. We test 9 state-of-the-art ML algorithms to find the most effective algorithm for modeling intraurban PM2.5 variations and we explore the relative importance of all sets of factors on intraurban air quality model performance. Results demonstrate that Gaussian-kernel support vector regression (SVR) was the most effective ML algorithm tested, improving accuracy by 118% compared to a traditional multiple linear regression (MLR) approach. Incorporating the enhanced approach with SVR algorithm increased model performance up to 18.4% for yearlong and 98.7% for month-long hourly datasets, respectively. Incorporating assumptions for human occupancy patterns in dominant building typologies resulted in improvements in model performance by between 4% and 37%. Combined, these innovations can be used to improve the performance and accuracy of urban air quality prediction models compared to conventional approaches.
引用
收藏
页数:19
相关论文
共 118 条
[1]   An integrated data-driven framework for urban energy use modeling (UEUM) [J].
Abbasabadi, Narjes ;
Ashayeri, Mehdi ;
Azari, Rahman ;
Stephens, Brent ;
Heidarinejad, Mohammad .
APPLIED ENERGY, 2019, 253
[2]   Urban energy use modeling methods and tools: A review and an outlook [J].
Abbasabadi, Narjes ;
Ashayeri, J. K. Mehdi .
BUILDING AND ENVIRONMENT, 2019, 161
[3]  
Allison P. D., 1999, Multiple regression: A primer
[4]   Global Air Quality and Health Co-benefits of Mitigating Near-Term Climate Change through Methane and Black Carbon Emission Controls [J].
Anenberg, Susan C. ;
Schwartz, Joel ;
Shindell, Drew ;
Amann, Markus ;
Faluvegi, Greg ;
Klimont, Zbigniew ;
Janssens-Maenhout, Greet ;
Pozzoli, Luca ;
Van Dingenen, Rita ;
Vignati, Elisabetta ;
Emberson, Lisa ;
Muller, Nicholas Z. ;
West, J. Jason ;
Williams, Martin ;
Demkine, Volodymyr ;
Hicks, W. Kevin ;
Kuylenstierna, Johan ;
Raes, Frank ;
Ramanathan, Veerabhadran .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2012, 120 (06) :831-839
[5]  
[Anonymous], 2009, Integrated Science Assessment for Particulate Matter
[6]  
[Anonymous], 2022, Modern Applied Statistics with S
[7]  
[Anonymous], 2003, PRACTICAL GUIDE SUPP
[8]  
Atkinson Beth, 2018, **DATA OBJECT**
[9]   Pilot study of the vertical variations in outdoor pollutant concentrations and environmental conditions along the height of a tall building [J].
Azimi, Parham ;
Zhao, Haoran ;
Fazli, Torkan ;
Zhao, Dan ;
Faramarzi, Afshin ;
Leung, Luke ;
Stephens, Brent .
BUILDING AND ENVIRONMENT, 2018, 138 :124-134
[10]   Planning for sustainable cities by estimating building occupancy with mobile phones [J].
Barbour, Edward ;
Davila, Carlos Cerezo ;
Gupta, Siddharth ;
Reinhart, Christoph ;
Kaur, Jasleen ;
Gonzalez, Marta C. .
NATURE COMMUNICATIONS, 2019, 10 (1)