Classification prediction model of indoor PM2.5 concentration using CatBoost algorithm

被引:3
作者
Guo, Zhenwei [1 ,2 ]
Wang, Xinyu [1 ]
Ge, Liang [1 ]
机构
[1] Chinese Soc Urban Studies, Beijing, Peoples R China
[2] Natl Engn Res Ctr Bldg Technol, Beijing, Peoples R China
关键词
indoor environment; PM2.5; limit; CatBoost model; classification prediction; machine learning; ARTIFICIAL NEURAL-NETWORKS; PARTICULATE MATTER; ENVIRONMENT; REGRESSION; QUALITY; PRODUCTIVITY; POLLUTION; SYSTEMS; SENSOR; IAQ;
D O I
10.3389/fbuil.2023.1207193
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
It is increasingly important to create a healthier indoor environment for office buildings. Accurate and reliable prediction of PM2.5 concentration can effectively alleviate the delay problem of indoor air quality control system. The rapid development of machine learning has provided a research basis for the indoor air quality system to control the PM2.5 concentration. One approach is to introduce the CatBoost algorithm based on rank lifting training into the classification and prediction of indoor PM2.5 concentration. Using actual monitoring data from office building, we consider previous indoor PM2.5 concentration, indoor temperature, relative humidity, CO2 concentration, and illumination as input variables, with the output indicating whether indoor PM2.5 concentration exceeds 25 mu g/m(3). Based on the CatBoost algorithm, we construct an intelligent classification prediction model for indoor PM2.5 concentration. The model is evaluated using actual data and compared with the multilayer perceptron (MLP), gradientboosting decision tree (GBDT), logistic regression (LR), decision tree (DT), and k-nearest neighbors (KNN) models. The CatBoost algorithm demonstrates outstanding predictive performance, achieving an impressive area under the ROC curve (AUC) of 0.949 after hyperparameters optimition. Furthermore, when considering the five input variables, the feature importance is ranked as follows: previous indoor PM2.5 concentration, relative humidity, CO2, indoor temperature, and illuminance. Through verification, the prediction model based on CatBoost algorithm can accurately predict the indoor PM2.5 concentration level. The model can be used to predict whether the indoor concentration of PM2.5 exceeds the standard in advance and guide the air quality control system to regulate.
引用
收藏
页数:10
相关论文
共 76 条
[1]   Integrating Statistical Machine Learning in a Semantic Sensor Web for Proactive Monitoring and Control [J].
Adeleke, Jude Adekunle ;
Moodley, Deshendran ;
Rens, Gavin ;
Adewumi, Aderemi Oluyinka .
SENSORS, 2017, 17 (04)
[2]  
Aditya, 2018, 2018 INT C ADV COMPU
[3]   Indoor Air Quality Analysis Using Deep Learning with Sensor Data [J].
Ahn, Jaehyun ;
Shin, Dongil ;
Kim, Kyuho ;
Yang, Jihoon .
SENSORS, 2017, 17 (11)
[4]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[5]   Occupant productivity and office indoor environment quality: A review of the literature [J].
Al Horr, Yousef ;
Arif, Mohammed ;
Kaushik, Amit ;
Mazroei, Ahmed ;
Katafygiotou, Martha ;
Elsarrag, Esam .
BUILDING AND ENVIRONMENT, 2016, 105 :369-389
[6]  
Amy Isabella S., 2022, NeuroImage Clin, V35, P103134, DOI [10.1016/j.nicl.2022.103134, DOI 10.1016/J.NICL.2022.103134]
[7]   Reassessment of the lethal London fog of 1952: Novel indicators of acute and chronic consequences of acute exposure to air pollution [J].
Bell, ML ;
Davis, DL .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2001, 109 :389-394
[8]  
Carlos G., 2018, ARPN J. Eng. Appl. Sci, V13, p2702
[9]  
Cheng C., 2019, Comput. Appl. Softw, V36, P7, DOI [10.3969/j.issn.1000-386x.2019.05.041, DOI 10.3969/J.ISSN.1000-386X.2019.05.041]
[10]   Impact of outdoor meteorology on indoor PM10, PM2.5 and PM1 concentrations in a naturally ventilated classroom [J].
Chithra, V. S. ;
Nagendra, S. M. Shiva .
URBAN CLIMATE, 2014, 10 :77-91