Improving Air Quality Prediction via Self-Supervision Masked Air Modeling

被引:1
作者
Chen, Shuang [1 ]
He, Li [2 ]
Shen, Shinan [1 ]
Zhang, Yan [1 ,3 ,4 ]
Ma, Weichun [1 ,3 ,4 ,5 ]
机构
[1] Fudan Univ, Dept Environm Sci & Engn, Shanghai 200438, Peoples R China
[2] Peking Univ, Shenzhen Grad Sch, Environm & Energy, Shenzhen 518055, Peoples R China
[3] Fudan Univ, Shanghai Key Lab Atmospher Particle Pollut & Preve, Shanghai 200433, Peoples R China
[4] Shanghai Key Lab Policy Simulat & Assessment Ecol, Shanghai 200433, Peoples R China
[5] Inst Ecochongming IEC, 3663 Northern Zhongshan Rd, Shanghai 200062, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
air quality prediction; deep learning; self-supervised learning; Transformer; O3; PARTICULATE MATTER; NEURAL-NETWORK; GLOBAL BURDEN; OZONE; CHINA; PM2.5; POLLUTION; PM10; SIMULATIONS; MORTALITY;
D O I
10.3390/atmos15070856
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Presently, the harm to human health created by air pollution has greatly drawn public attention, in particular, vehicle emissions including nitrogen oxides as well as particulate matter. How to predict air quality, e.g., pollutant concentration, efficiently and accurately is a core problem in environmental research. Developing a robust air quality predictive model has become an increasingly important task, holding practical significance in the formulation of effective control policies. Recently, deep learning has progressed significantly in air quality prediction. In this paper, we go one step further and present a neat scheme of masked autoencoders, termed as masked air modeling (MAM), for sequence data self-supervised learning, which addresses the challenges posed by missing data. Specifically, the front end of our pipeline integrates a WRF-CAMx numerical model, which can simulate the process of emission, diffusion, transformation, and removal of pollutants based on atmospheric physics and chemical reactions. Then, the predicted results of WRF-CAMx are concatenated into a time series, and fed into an asymmetric Transformer-based encoder-decoder architecture for pre-training via random masking. Finally, we fine-tune an additional regression network, based on the pre-trained encoder, to predict ozone (O 3) concentration. Coupling these two designs enables us to consider the atmospheric physics and chemical reactions of pollutants while inheriting the long-range dependency modeling capabilities of the Transformer. The experimental results indicated that our approach effectively enhances the WRF-CAMx model's predictive capabilities and outperforms pure supervised network solutions. Overall, using advanced self-supervision approaches, our work provides a novel perspective for further improving air quality forecasting, which allows us to increase the smartness and resilience of the air prediction systems. This is due to the fact that accurate prediction of air pollutant concentrations is essential for detecting pollution events and implementing effective response strategies, thereby promoting environmentally sustainable development.
引用
收藏
页数:14
相关论文
共 54 条
[1]   An Estimate of the Global Burden of Anthropogenic Ozone and Fine Particulate Matter on Premature Human Mortality Using Atmospheric Modeling [J].
Anenberg, Susan C. ;
Horowitz, Larry W. ;
Tong, Daniel Q. ;
West, J. Jason .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2010, 118 (09) :1189-1195
[2]  
Athira V., 2018, Procedia Computer Science, V132, P1394, DOI 10.1016/j.procs.2018.05.068
[3]   Simulations of Organic Aerosol with CAMx over the Po Valley during the Summer Season [J].
Basla, Barbara ;
Agresti, Valentina ;
Balzarini, Alessandra ;
Giani, Paolo ;
Pirovano, Guido ;
Gilardoni, Stefania ;
Paglione, Marco ;
Colombi, Cristina ;
Belis, Claudio A. A. ;
Poluzzi, Vanes ;
Scotto, Fabiana ;
Lonati, Giovanni .
ATMOSPHERE, 2022, 13 (12)
[4]   Co-effect assessment on regional air quality: A perspective of policies and measures with greenhouse gas reduction potential [J].
Chen, Wanqi ;
Tang, Haoyue ;
He, Li ;
Zhang, Yan ;
Ma, Weichun .
SCIENCE OF THE TOTAL ENVIRONMENT, 2022, 851
[5]   Health benefits of decreases in on-road transportation emissions in the United States from 2008 to 2017 [J].
Choma, Ernani F. ;
Evans, John S. ;
Gomez-Ibanez, Jose A. ;
Di, Qian ;
Schwartz, Joel D. ;
Hammitt, James K. ;
Spengler, John D. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (51)
[6]  
Cohen AJ, 2017, LANCET, V389, P1907, DOI [10.1016/S0140-6736(17)30505-6, 10.1016/s0140-6736(17)30505-6]
[7]   Air pollution prediction via multi-label classification [J].
Corani, Giorgio ;
Scanagatta, Mauro .
ENVIRONMENTAL MODELLING & SOFTWARE, 2016, 80 :259-264
[8]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[9]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[10]   PM10 concentration forecasting in the metropolitan area of Oviedo (Northern Spain) using models based on SVM, MLP, VARMA and ARIMA: A case study [J].
Garcia Nieto, P. J. ;
Sanchez Lasheras, F. ;
Garcia-Gonzalo, E. ;
de Cos Juez, F. J. .
SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 621 :753-761