An approach for determining relative input parameter importance and significance in artificial neural networks

被引:51
作者
Kemp, Stanley J.
Zaradic, Patricia
Hansen, Frank
机构
[1] Univ Penn, Leidy Labs, Dept Biol, Philadelphia, PA 19104 USA
[2] Stroud Water Res Ctr, Avondale, PA 19311 USA
关键词
ANN; simulation; artificial neural network; parameter importance; virtual ecology;
D O I
10.1016/j.ecolmodel.2007.01.009
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Artificial neural network (ANN) models are powerful statistical tools which are increasingly used in modeling complex ecological systems. For interpretation of ANN models, a means of evaluating how systemic parameters contribute to model output is essential. Developing a robust, systematic method for interpreting ANN models is the subject of much current research. We propose a method using sequential randomization of input parameters to determine the relative proportion to which each input variable contributes to the predictive ability of the ANN model (termed the holdback input randomization method or HIPR method). Validity of the method was assessed using a simulated data set in which the relationship between input parameters and output parameters were completely known. Simulated data sets were generated with known linear, nonlinear, and collinear relationships. The HIPR method was performed repetitively on ANN models trained on these data sets. The method was successful in predicting rank order of importance on all data sets, performing as well as or better than the recently proposed connectivity weight method. one main advantage of using this method relative to others is that results can be obtained without making assumptions regarding the architecture of the ANN model used. These results also serve to illustrate the consistency and information content of ANN models in general, and highlight their potential use in exploring ecological relationships. The HIPR method is a robust, simple, general procedure for interpreting complex ecological systems as captured by ANN models. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:326 / 334
页数:9
相关论文
共 13 条
[1]   Artificial neural networks: fundamentals, computing, design, and application [J].
Basheer, IA ;
Hajmeer, M .
JOURNAL OF MICROBIOLOGICAL METHODS, 2000, 43 (01) :3-31
[2]   Review and comparison of methods to study the contribution of variables in artificial neural network models [J].
Gevrey, M ;
Dimopoulos, L ;
Lek, S .
ECOLOGICAL MODELLING, 2003, 160 (03) :249-264
[3]   Individual-based modelling and ecological theory: synthesis of a workshop [J].
Grimm, V ;
Wyszomirski, T ;
Aikman, D ;
Uchmanski, J .
ECOLOGICAL MODELLING, 1999, 115 (2-3) :275-282
[4]   Energy availability and habitat heterogeneity predict global riverine fish diversity [J].
Guégan, JF ;
Lek, S ;
Oberdorff, T .
NATURE, 1998, 391 (6665) :382-384
[5]  
Hastie T., 2009, The Elements of Statistical Learning, P9
[6]   Application of neural networks to modelling nonlinear relationships in ecology [J].
Lek, S ;
Delacoste, M ;
Baran, P ;
Dimopoulos, I ;
Lauga, J ;
Aulagnier, S .
ECOLOGICAL MODELLING, 1996, 90 (01) :39-52
[7]  
Lek S, 2000, ENVIRONM SCI, P3
[8]   An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data [J].
Olden, JD ;
Joy, MK ;
Death, RG .
ECOLOGICAL MODELLING, 2004, 178 (3-4) :389-397
[9]   Illuminating the "black box": a randomization approach for understanding variable contributions in artificial neural networks [J].
Olden, JD ;
Jackson, DA .
ECOLOGICAL MODELLING, 2002, 154 (1-2) :135-150
[10]   Developing an empirical model of phytoplankton primary production: a neural network case study [J].
Scardi, M ;
Harding, LW .
ECOLOGICAL MODELLING, 1999, 120 (2-3) :213-223