Evaluation and Predicting PM10 Concentration Using Multiple Linear Regression and Machine Learning

被引:13
作者
Son, Sanghun [1 ,2 ]
Kim, Jinsoo [3 ]
机构
[1] Pukyong Natl Univ, Div Earth Environm Syst Sci, Busan, South Korea
[2] Pukyong Natl Univ, Spatial Informat Engn, Busan, South Korea
[3] Pukyong Natl Univ, Dept Spatial Informat Engn, Busan, South Korea
关键词
PM10; concentration; Meteorological Variables; Multiple Linear Regression; Support Vector Machine; Random Forest; PARTICULATE MATTER; PM2.5;
D O I
10.7780/kjrs.2020.36.6.3.7
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Particulate matter (PM) that has been artificially generated during the recent of rapid industrialization and urbanization moves and disperses according to weather conditions, and adversely affects the human skin and respiratory systems. The purpose of this study is to predict the PM10 concentration in Seoul using meteorological factors as input dataset for multiple linear regression (MLR), support vector machine (SVM), and random forest (RF) models, and compared and evaluated the performance of the models. First, the PM10 concentration data obtained at 39 air quality monitoring sites (AQMS) in Seoul were divided into training and validation dataset (8:2 ratio). The nine meteorological factors (mean, maximum, and minimum temperature, precipitation, average and maximum wind speed, wind direction, yellow dust, and relative humidity), obtained by the automatic weather system (AWS), were composed to input dataset of models. The coefficients of determination (R-2) between the observed PM10 concentration and that predicted by the MLR, SVM, and RF models was 0.260, 0.772, and 0.793, respectively, and the RF model best predicted the PM10 concentration. Among the AQMS used for model validation, Gwanak-gu and Gangnam-daero AQMS are relatively close to AWS, and the SVM and RF models were highly accurate according to the model validations. The Jongno-gu AQMS is relatively far from the AWS, but since PM10 concentration for the two adjacent AQMS were used for model training, both models presented high accuracy. By contrast, Yongsan-gu AQMS was relatively far from AQMS and AWS, both models performed poorly.
引用
收藏
页码:1711 / 1720
页数:10
相关论文
共 31 条
[1]   Development of Multiple Linear Regression for Particulate Matter (PM10) Forecasting during Episodic Transboundary Haze Event in Malaysia [J].
Abdullah, Samsuri ;
Napi, Nur Nazmi Liyana Mohd ;
Ahmed, Ali Najah ;
Mansor, Wan Nurdiyana Wan ;
Abu Mansor, Amalina ;
Ismail, Marzuki ;
Abdullah, Ahmad Makmom ;
Ramly, Zamzam Tuah Ahmad .
ATMOSPHERE, 2020, 11 (03)
[2]   PM10 Prediction Model by Support Vector Regression Based on Particle Swarm Optimization [J].
Arampongsanuwat, Saowalak ;
Meesad, Phayung .
MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 :3693-+
[3]  
Bozdag A, 2020, ENVIRON POLLUT, V263, P1
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis [J].
Chen, Hui-Ling ;
Yang, Bo ;
Liu, Jie ;
Liu, Da-You .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :9014-9022
[6]  
Choubin B., 2020, SCI TOTAL ENVIRON, V701, P1
[7]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[8]   A hybrid ARIMA and artificial neural networks model to forecast particulate matter in urban areas: The case of Temuco, Chile [J].
Diaz-Robles, Luis A. ;
Ortega, Juan C. ;
Fu, Joshua S. ;
Reed, Gregory D. ;
Chow, Judith C. ;
Watson, John G. ;
Moncada-Herrera, Juan A. .
ATMOSPHERIC ENVIRONMENT, 2008, 42 (35) :8331-8340
[9]   Random forest meteorological normalisation models for Swiss PM10 trend analysis [J].
Grange, Stuart K. ;
Carslaw, David C. ;
Lewis, Alastair C. ;
Boleti, Eirini ;
Hueglin, Christoph .
ATMOSPHERIC CHEMISTRY AND PHYSICS, 2018, 18 (09) :6223-6239
[10]  
Han J, 2008, J KOREAN SOC ATMOS E, V24, P300