Maximum likelihood cost functions for neural network models of air quality data

被引:48
作者
Dorling, SR
Foxall, RJ
Mandic, DP
Cawley, GC [1 ]
机构
[1] Univ E Anglia, Sch Informat Syst, Norwich NR4 7TJ, Norfolk, England
[2] Univ E Anglia, Sch Environm Sci, Norwich NR4 7TJ, Norfolk, England
[3] Univ London Imperial Coll Sci Technol & Med, Dept Elect & Elect Engn, London SW7 2BT, England
关键词
neural network; ozone; modelling exceedences;
D O I
10.1016/S1352-2310(03)00323-6
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The prediction of episodes of poor air quality using artificial neural networks is investigated, concentrating on selection of the most appropriate cost function used in training. Different cost functions correspond to different distributional assumptions regarding the data, the appropriate choice depends on whether a forecast of absolute pollutant concentration or prediction of exceedence events is of principle importance. The cost functions investigated correspond to logistic regression, homoscedastic Gaussian (i.e. conventional sum-of-squares) regression and heteroscedastic Gaussian regression. Both linear and nonlinear neural network architectures are evaluated. While the results presented relate to a dataset describing the daily time-series of the concentration of surface level ozone (O-3) in urban Berlin, the methods applied are quite general and applicable to a wide range of pollutants and locations. The heteroscedastic Gaussian regression model outperforms the other nonlinear methods investigated; however, there is little improvement resulting from the use of nonlinear rather than linear models. Of greater significance is the flexibility afforded by the nonlinear heteroscedastic Gaussian regression model for a range of potential end-users, who may all have different answers to the question: "What is more important, correctly predicting exceedences or avoiding false alarms?". (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:3435 / 3443
页数:9
相关论文
共 22 条
[1]   Comparing classifiers when the misallocation costs are uncertain [J].
Adams, NM ;
Hand, DJ .
PATTERN RECOGNITION, 1999, 32 (07) :1139-1147
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[4]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[5]   Comparing neural networks and regression models for ozone forecasting [J].
Comrie, AC .
JOURNAL OF THE AIR & WASTE MANAGEMENT ASSOCIATION, 1997, 47 (06) :653-663
[6]  
Foxall R, 2001, INT CONF ACOUST SPEE, P3505, DOI 10.1109/ICASSP.2001.940597
[7]   Artificial neural networks (the multilayer perceptron) - A review of applications in the atmospheric sciences [J].
Gardner, MW ;
Dorling, SR .
ATMOSPHERIC ENVIRONMENT, 1998, 32 (14-15) :2627-2636
[8]   Neural network modelling and prediction of hourly NOx and NO2 concentrations in urban air in London [J].
Gardner, MW ;
Dorling, SR .
ATMOSPHERIC ENVIRONMENT, 1999, 33 (05) :709-719
[9]  
GARDNER MW, 1999, ATMOS ENVIRON, V34, P21
[10]  
GRIEG AJE, 2000, P 8 INT C AIR POLL N, P89