Simplifying artificial neural network models of river basin behaviour by an automated procedure for input variable selection

被引:22
作者
Oliveira, Guilherme G. [1 ]
Pedrollo, Olavo C. [1 ]
Castro, Nilza M. R. [1 ]
机构
[1] Univ Fed Rio Grande do Sul, Inst Hydraul Res, BR-91501970 Porto Alegre, RS, Brazil
关键词
Relative contribution index (RCI); Nested drainage basins; Hydrological simulation; Effect of scale; RUNOFF; PREDICTION;
D O I
10.1016/j.engappai.2015.01.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The objective of the present work is to present a simplified and automated method for identifying and excluding unnecessary input variables, with a consequent reduction in dimensionality of ANN-based hydrological models. The proposed method is iterative and computationally efficient: it consists of perturbing the input variables, recording the change in model performance, establishing an index showing the contribution of each variable to the ANN (the relative contribution index, RCI) and excluding the least-influential variables that fall below a threshold. The method was used to simulate mean daily flow for a 20-year period 1989-2009 from four drainage basins nested at different scales ranging from 19.4 km(2) to 9426 km, in the Southern Brazil. The main result of this method of simplifying ANN-based hydrological models was to increase the Nash-Sutcliffe (NS) coefficient and to reduce RMSE in all the simulations undertaken. The potential of ANN models was therefore improved by eliminating unnecessary and/or redundant variables. Simulating the intermediate basin with area 5414 km(2) (Santo Angelo), for example, the initial performance (12 inputs; NS= 0.894) improved when a simpler and more parsimonious model was used (4 inputs; NS= 0.944). To validate the simplification procedure, a comparison was made between the proposed method (RCI) and the well-known methods of Overall Connection Weights (OCW) and Forward Stepwise Addition (FSA). For the comparison between RCI and OCW methods, in most cases, the ordering of selected variables was similar, confirming that the two procedures satisfactorily identify the more important variables, although the RCI is computationally more efficient giving a small advantage in the resulting model performance. In the FSA method, although the performance of the obtained models has also been satisfactory, the computational effort was much greater than with the other two methods because of the excessive number of the neural network training performed (117 training procedures in Combination 2, against only six for the RCI method, for example). (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:47 / 61
页数:15
相关论文
共 48 条
  • [1] NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION
    AKAIKE, H
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) : 716 - 723
  • [2] Alegre P., 1980, PROC HYDROLOGICAL FO, P445
  • [3] [Anonymous], FINAL REPORT ACTIVIT
  • [4] [Anonymous], IAHS PUBLICATION INT
  • [5] [Anonymous], 4 1960 IRE WESCON CO
  • [6] [Anonymous], 1991, INTERPRETING NEURAL
  • [7] [Anonymous], 2013, REV BRAS RECURSOS HI
  • [8] [Anonymous], THESIS FEDERAL U RIO
  • [9] [Anonymous], 17 S BRAS REC HIDR S
  • [10] Bishop CM, 1995, Neural Networks for Pattern Recognition