A Gaussian Process Method with Uncertainty Quantification for Air Quality Monitoring

被引:6
作者
Wang, Peng [1 ]
Mihaylova, Lyudmila [2 ]
Chakraborty, Rohit [3 ]
Munir, Said [3 ]
Mayfield, Martin [3 ]
Alam, Khan [4 ]
Khokhar, Muhammad Fahim [5 ]
Zheng, Zhengkai [6 ]
Jiang, Chengxi [7 ]
Fang, Hui [8 ]
机构
[1] Manchester Metropolitan Univ, Dept Comp & Math, Manchester M15 6BH, Lancs, England
[2] Univ Sheffield, Dept Automat Control & Syst Engn, Sheffield S10 2TN, S Yorkshire, England
[3] Univ Sheffield, Dept Civil & Struct Engn, Sheffield S10 2TN, S Yorkshire, England
[4] Univ Peshawar, Dept Phys, Peshawar 25120, Pakistan
[5] Natl Univ Sci & Technol, Inst Environm Sci & Engn, Islamabad 44000, Pakistan
[6] Yueqing Xinshou Agr Dev Co Ltd, Yueqing 325604, Peoples R China
[7] Wenzhou Univ, Coll Elect & Elect Engn, Wenzhou 325035, Peoples R China
[8] Zhejiang Univ, Coll Biosyst Engn & Food Sci, Hangzhou 310058, Peoples R China
基金
英国工程与自然科学研究理事会; 美国国家科学基金会;
关键词
Gaussian process; uncertainty quantification; air quality forecasting; low-cost sensors; sustainable development; POLLUTION;
D O I
10.3390/atmos12101344
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The monitoring and forecasting of particulate matter (e.g., PM2.5) and gaseous pollutants (e.g., NO, NO2, and SO2) is of significant importance, as they have adverse impacts on human health. However, model performance can easily degrade due to data noises, environmental and other factors. This paper proposes a general solution to analyse how the noise level of measurements and hyperparameters of a Gaussian process model affect the prediction accuracy and uncertainty, with a comparative case study of atmospheric pollutant concentrations prediction in Sheffield, UK, and Peshawar, Pakistan. The Neumann series is exploited to approximate the matrix inverse involved in the Gaussian process approach. This enables us to derive a theoretical relationship between any independent variable (e.g., measurement noise level, hyperparameters of Gaussian process methods), and the uncertainty and accuracy prediction. In addition, it helps us to discover insights on how these independent variables affect the algorithm evidence lower bound. The theoretical results are verified by applying a Gaussian processes approach and its sparse variants to air quality data forecasting.
引用
收藏
页数:18
相关论文
共 26 条
[1]  
[Anonymous], 2017, IMPR AIR QUAL TACKL
[2]  
[Anonymous], 2006, Air Quality Guidelines: Global Update 2005: Particulate Matter, Ozone, Nitrogen Dioxide, and Sulfur Dioxide
[3]   Air Pollution Forecasts: An Overview [J].
Bai, Lu ;
Wang, Jianzhou ;
Ma, Xuejiao ;
Lu, Haiyan .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (04)
[4]  
Burt DR, 2019, PR MACH LEARN RES, V97
[5]   Living near major roads and the incidence of dementia, Parkinson's disease, and multiple sclerosis: a population-based cohort study [J].
Chen, Hong ;
Kwong, Jeffrey C. ;
Copes, Ray ;
Tu, Karen ;
Villeneuve, Paul J. ;
van Donkelaar, Aaron ;
Hystad, Perry ;
Martin, Randall V. ;
Murray, Brian J. ;
Jessiman, Barry ;
Wilton, Andrew S. ;
Kopp, Alexander ;
Burnett, Richard T. .
LANCET, 2017, 389 (10070) :718-726
[6]   How priors of initial hyperparameters affect Gaussian process regression models [J].
Chen, Zexun ;
Wang, Bo .
NEUROCOMPUTING, 2018, 275 :1702-1710
[7]  
Garrido A., 2018, As pescas em Portugal, P1
[8]   Full-chain health impact assessment of traffic-related air pollution and childhood asthma [J].
Khreis, Haneen ;
de Hoogh, Kees ;
Nieuwenhuijsen, Mark J. .
ENVIRONMENT INTERNATIONAL, 2018, 114 :365-375
[9]   Air pollution and health [J].
Landrigan, Philip J. .
LANCET PUBLIC HEALTH, 2017, 2 (01) :E4-E5
[10]   Forecasting concentrations of air pollutants by logarithm support vector regression with immune algorithms [J].
Lin, Kuo-Ping ;
Pai, Ping-Feng ;
Yang, Shun-Ling .
APPLIED MATHEMATICS AND COMPUTATION, 2011, 217 (12) :5318-5327