Fish recruitment prediction, using robust supervised classification methods

被引:60
作者
Fernandes, Jose A. [1 ,2 ]
Irigoien, Xabier [1 ]
Goikoetxea, Nerea [1 ]
Lozano, Jose A. [2 ]
Inza, Inaki [2 ]
Perez, Aritz [2 ]
Bode, Antonio [3 ]
机构
[1] AZTI Tecnalia, Div Marine Res, E-20110 Pasaia, Spain
[2] Univ Basque Country, Dept Comp Sci & AI, ISG, E-20018 San Sebastian, Spain
[3] Inst Espanol Oceanog, Ctr Oceanog A Coruna, E-15080 La Coruna, Spain
关键词
Supervised classification; Ecological modelling; Fish recruitment; Discretization; Feature selection; Climate; Anchovy; Hake; ANCHOVY ENGRAULIS-ENCRASICOLUS; BAYESIAN NETWORKS; BISCAY ANCHOVY; BAY; ENVIRONMENT; CLIMATE; MODEL; DISCRETIZATION; VARIABILITY; SARDINE;
D O I
10.1016/j.ecolmodel.2009.09.020
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Improving our ability to predict recruitment is a key element in fisheries management. However, the interactions between population dynamics and different environmental factors are complex and often non-linear, making it difficult to produce robust predictions. 'Machine-learning' techniques (in particular, supervised classification methods) have been proposed as useful tools, to overcome such difficulties. In this study, a methodology is proposed to build a robust classifier for fish recruitment prediction with sparse and noisy data. The methodology consists of 4 steps: (1) a semi-automated recruitment discretization method; (2) supervised discretization of predictors; (3) multivariate and non-redundant predictors selection: (4) learning a probabilistic classifier. in terms of fisheries management, the classifier estimated performance has important consequences and, to be useful, the manager needs to know the risk that is being taken when using this number. Probabilistic classifiers such as 'naive Bayes', have the advantage that, in addition to the predictions, estimate also the probability of each possible outcome. Anchovy (Engraulis encrasicolus) and hake (Merluccius merluccius) recruitments are used as application examples. 'Two-intervals' recruitment discretization accomplishes 70% accuracies and Brier scores of around 0.10, for both anchovy and hake recruitment. In comparison, 'three-intervals' recruitment discretization accomplishes 50% accuracies; and Brier scores of around 0.25 for anchovy and 0.30 for hake recruitment. These statistics are the result of validating not only the classifier, but also the previous steps, as a whole methodology. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:338 / 352
页数:15
相关论文
共 94 条
[81]   A review of feature selection techniques in bioinformatics [J].
Saeys, Yvan ;
Inza, Inaki ;
Larranaga, Pedro .
BIOINFORMATICS, 2007, 23 (19) :2507-2517
[82]   Interannual changes in sablefish (Anoplopoma fimbria) recruitment in relation to oceanographic conditions within the California Current System [J].
Schirripa, MJ ;
Colbert, JJ .
FISHERIES OCEANOGRAPHY, 2006, 15 (01) :25-36
[83]  
Sebastiani P, 2005, DATA MINING AND KNOWLEDGE DISCOVERY HANDBOOK, P193, DOI 10.1007/0-387-25465-X_10
[84]  
SHANNON CE, 1948, BELL SYST TECH J, V27, P379, DOI DOI 10.1002/J.1538-7305.1948.TB01338.X
[85]   A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis [J].
Statnikov, A ;
Aliferis, CF ;
Tsamardinos, I ;
Hardin, D ;
Levy, S .
BIOINFORMATICS, 2005, 21 (05) :631-643
[86]   CROSS-VALIDATORY CHOICE AND ASSESSMENT OF STATISTICAL PREDICTIONS [J].
STONE, M .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1974, 36 (02) :111-147
[87]  
TORGO L, 1997, LECT NOTES COMPUT SC, P266
[88]   Advantages and challenges of Bayesian networks in environmental modelling [J].
Uusitalo, Laura .
ECOLOGICAL MODELLING, 2007, 203 (3-4) :312-318
[89]  
Van der Gaag L. C., 2001, P 13 BELG NETH C ART, P109
[90]  
Witten I.H., 2005, Data Mining: Practical machine learning tools and techniques, V2nd